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STREPTOCOCCUS PNEUMONIAE PROTEINS AND NUCLEIC ACID MOLECULES 

The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an 
important pathogenic organism. The continuing significance of Streptoccocus 
pneumoniae infections in relation to human disease in developing and developed 
countries has been authoritatively reviewed (Fiber, G.R., Science, 265: 1385-1387 
(1994)). That indicates that on a global scale this organism is believed to be the 
most common bacterial cause of acute respiratory infections, and is estimated to 
result in 1 million childhood deaths each year, mostly in developing countries 
(Stansfield, S.K., Pediatr. Infect. Dis. y 6: 622 (1987)). In the USA it has been 
suggested (Breiman et al, Arch. Intern. Med., 150: 1401 (1990)) that the 
pneumococcus is still the most common cause of bacterial pneumonia, and that 
disease rates are particularly high in young children, in the elderly, and in patients 
with j predisposing conditions such as asplenia, heart, lung and kidney disease, 
diabetes, alcoholism, or with immunosupressive disorders, especially AIDS. These 
groups are at higher risk of pneumococcal septicaemia and hence meningitis and 
therefore have a greater risk of dying from pneumococcal infection. The 
pneumococcus is also the leading cause of otitis media and sinusitis, which remain 
prevalent infections in children in developed countries, and which incur substantial 
costs. 

The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 



PCT/GB99/02451 

2 

to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics 
of the Centres for Disease Control/National Centre for Health Statistics, 214:1 
(1992)). The rates of penicillin resistance can be higher (up to 20%) in some 
5 hospitals (Breiman et al, J. Am. Med. Assoc., 271: 1831 (1994)). Since the 
development of penicillin resistance among pneumococci is both recent and sudden, 
coming after decades during which penicillin remained an effective treatment, these 
findings are regarded as alarming. 

10 For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention 
15 of pneumococcal infections. Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the structure of the polysaccharide capsule 
surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means that vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
20 cases. An additional problem arises because it ahs been found that the capsular 

polysaccharides (each of which determines the serotype and is the major protective 
antigen) when purified and used as a vaccine do not reliably induce protective 
antibody responses in children under two years of age, the age group which suffers 
the highest incidence of invasive pneumococcal infection and meningitis. 

25 

A modification of the approach using capsule antigens relies on conjugating the 
polysaccharide to a protein in order to derive an enhanced immune response, 
particularly by giving the response T-cell dependent character. This approach has 
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been used in the development of a vaccine against Haemophilus influenzae. There 
are issues of cost concerning both the multi-polysaccharide vaccines and those based 
on conjugates. 

5 

A third approach is to look for other antigenic components which offer the potential 
to be vaccine candidates. In the present application we provide a group of proteins 
antigens which are secreted/exported proteins. 

10 Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2 
herein. 

A protein or polypeptide of the present invention may be provided in substantially pure 
15 form. For example, it may be provided in a form which is substantially free of other 
proteins. 

In a preferred embodiment, a protein or polypeptide having an amino acid sequence as 
shown in Table 3 is provided. 

20 

The invention encompasses any protein coded for by a nucleic acid sequence as shown 
in Table 1 herein. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
25 antigenic material. Such material can be "antigenic" and/or "immunogenic". 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. "Immunogenic" is taken to mean that the protein or polypeptide is capable of 
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eliciting a protective immune response in a subject. Thus, in the latter case, the protein 
or polypeptide may be capable of not only generating an antibody response and in 
addition non-antibody based immune responses. 



The skilled person will appreciate that homologues or derivatives of the proteins or 
10 polypeptides of the invention will also find use in the context of the present invention, 
ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides 
which include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
15 amino acid with another. One can use a program such as the CLUSTAL program to 
compare amino acid sequences. This program compares amino acid sequences and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. It is 
possible to calculate amino acid identity or similarity (identity plus conservation of 
amino acid type) for an optimal alignment. A program like BLASTx will align the 
20 longest stretch of similar sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are found, each having a 
different score. Both types of analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
25 polypeptide as described herein is less important than that the homologue or derivative 
should retain its antigenicity or immunogenicity to streptoccocus pneumoniae. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
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Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95% similarity are provided. 

In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag" or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 

In an additional aspect of the invention there are provided antigenic fragments of the 
proteins or polypeptides of the invention, or of homologues or derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
derivatives thereof, the situation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein or polypeptide's antigenicity or immunogenicity. 
Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
sufficiently similar to such regions to retain their antigenic/immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

Thus, what is important for homologues, derivatives and fragments is that they possess 
at least a degree of the antigenicity/immunogenicity of the protein or polypeptide from 
which they are derived. 
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Gene cloning techniques may be used to provide a protein of the invention in 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook 
et al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
Thus, in a fourth aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1. 

In a fifth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 
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(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a protein 
as defined in Table 4. 

The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 
include novel variants of those particular novel nucleic acid molecules which are 
exemplified herein..._.Such variants are encompassed by the present invention. These 
may occur in nature, for example because of strain variation. For example, additions, 
substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-naturally occurring variants are also included within 
the scope of the invention. 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for the 
fact that in RNA "U" replaces "T n in the genetic code). 

When comparing nucleic acid sequences for the purposes of determining the degree of 
homology or identity one can use programs such as BESTFIT and GAP (both from the 
Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 
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similar segments. GAP enables sequences to be aligned along their whole length and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. 
Suitably, in the context of the present invention compare when discussing identity of 
nucleic acid sequences, the comparison is made by alignment of the sequences along 
their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at 
least 95% sequence identity with said sequences. In some cases the sequence identity 
may be 99% or above. 

Desirably, the term "substantial identity" indicates that said sequence has a greater 
degree of identity with any of the sequences described herein than with prior art nucleic 
acid sequences. 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 

The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such vectors 
and suitable hosts form yet further aspects of the present invention. 

Therefore, for example, by using probes based upon the nucleic acid sequences 
provided herein, genes in Streptococcus pneumoniae can be identified. They can then 
be excised using restriction enzymes and cloned into a vector. The vector can be 
introduced into a suitable host for expression. 
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Nucleic acid molecules of the present invention may be obtained from S.pneumoniae by 
the use of appropriate probes complementary to part of the sequences of the nucleic 
acid molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

Alternatively PCR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PCR so that a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. One primer will normally show a high 
degree of specificity for a first sequence located on one strand of a DNA molecule, and 
the other primer will normally show a high degree of specificity for a second sequence 
located on the complementary strand of the DNA sequence and being spaced from the 
complementary sequence to the first sequence. 

Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to 
provide a longer sequence. 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 2-4, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
immunogenic/antigenic composition is a vaccine or is for use in a diagnostic assay. 

In the case of vaccines suitable additional excipients, diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 
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It is also possible to utilise the nucleic acid sequences shown in Table 1 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. The 
5 use of such DNA vaccines is described in the art. See for instance, Donnelly et al , 
Ann. Rev. Immunol, 15:617-648 (1997). 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 

10 of detecting/diagnosing S.pneumoniae. Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S.pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or homologue, derivative or fragment thereof, as described herein. 

15 Suitably, the sample is a biological sample, such as a tissue sample or a sample of 
blood or saliva obtained from a subject to be tested. 

In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used 
20 to detect the antigens, and hence S.pneumoniae. Such antibodies form another aspect 
of the invention. Antibodies within the scope of the present invention may be 
monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
25 host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a homologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well- 
known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium 
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hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by 
fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique {Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonal antibodies that bind to a 
particular polypeptide/protein are now well developed in the art. They are discussed in 
standard immunology textbooks, for example in Roitt et al, Immunology second edition 
(1989), Churchill Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which 
are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 
1994). 

Antibody fragments include, for example, Fab, F(ab') 2 and Fv fragments. Fab 
fragments (These are discussed in Roitt et al [supra] ). Fv fragments can be modified 
to produce a synthetic construct known as a single chain Fv (scFv) molecule. This 
includes a peptide linker covalently joining V h and V, regions, which contributes to the 
stability of the molecule. Other synthetic constructs that can be used include CDR 
peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformationally 
restricted organic rings that mimic the structure of a CDR loop and that include 
antigen- interactive side chains. 
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Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human 
5 framework regions, but rodent hypervariable regions. Ways of producing chimaeric 
antibodies are discussed for example by Morrison et al in PNAS, 81, 6851-6855 (1984) 
and by Takeda et al in Nature. 314, 452-454 (1985). 

Synthetic constructs also include molecules comprising an additional moiety that 
10 provides the molecule with some desirable property in addition to antigen binding. For 
example the moiety may be a label (e.g. a fluorescent or radioactive label). 
Alternatively, it may be a pharmaceutically active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S.pneumoniae. 
15 Thus, in another aspect the present invention provides a method for the 
detection/diagnosis of S.pneumoniae which comprises the step of bringing into contact 
a sample to be tested and antibodies capable of binding to one or more proteins 
described herein, or to homologues, derivatives and/or fragments thereof. 

20 In addition, so-called "Affibodies" may be utilised. These are binding proteins 
selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
(Nord et al , ) Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches. 

25 

It will also be clear that the nucleic acid sequences described herein may be used to 
detect/diagnose S.pneumoniae. Thus, in yet a further aspect, the present invention 
provides a method for the detection/diagnosis of S.pneumoniae which comprises the 
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step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
Such samples may be pre-treated before being used in the methods of the invention. 
Trhus, for example, a sample may be treated to extract DNA. Then, DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S .pneumoniae. 

In additional aspects, the present invention provides: 

(a) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 
invention; 

(b) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

(c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a protein or polypeptide of the 
invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

(d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a nucleic acid molecule as defined 
herein; 

(e) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 



WO 00/06737 



14 



PCT/GB99/02451 



or more proteins or polypeptides of the invention, or homologues, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S. pneumoniae infection comprising one 
or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
present invention also provides a method of determining whether a protein or 
polypeptide as described herein represents a potential anti-microbial target which 
comprises inactivating said protein and determining whether S.pneumoniae is still 
viable, in vitro or in vivo. 

A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li et 
al P.N.A.S., 94:13251-13256 (1997). 

In a final aspect the present invention provides the use of an agent capable of 
antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of S.pneumoniae infection. 

The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The examples 
refer to the figures in which: 
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Figure 1: shows the results of various DNA vaccine trials; and 
Figure 2: shows the results of further DNA vaccine trials. 

EXAMPLE 1 

5 

The Genome sequencing of Streptococcus pneumoniae type 4 is in progress at the 

Institute for Genomic Research (TIGR, Rockville, MD, USA). Up to now, the 
whole sequence has not been completed or published. On 21 st November 1997, the 

10 TIGR centre released some DNA sequences as contigs which are not accurate 
reflections of the finished sequence. These contigs can be downloaded from their 
Webster (www@tigr.org) . We downloaded these contigs and created a local database 
using the application GCGToBLAST (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison, USA). This database can be searched with the 

15 Fast A and TfastA procedures (using the method of Pearson and Lipman (PNAS USA, 
85:2444-2448 (1988)). 

Using FastA and TfastA procedures, the local pneumococcus database was searched 
for putative leader sequence or anchor sequence features. Relevant sequences were 
20 used to interrogate for comparative novel sequences. These were: 

(i) already described leader sequences of Streptococcus pneumoniae (from 
proteins NanA, NanB, LytA, PapA, pcpA, PsaA and PspA); 

25 (ii) the leader sequence of Usp45, a secreted protein from Lactococcus lactis; 



(iii) new hypothetical leader sequences derived from the searches in (i) and 
(ii); 
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(iv) the anchor motif LPxTG, a feature common to many Gram-positive 
bacteria surface proteins which are anchored by a mechanism involving the 
Sortase complex proteins. 

Provided below is an example of this approach, with reference to the sequences 
derived from the database (see table 1). 
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15 



20 



Usp45 



NanA 



PapA PsaA 



PspA 



PrtP 





4121.1 4101.3 4172.4 4106.1 4346.1 4194.1 4172.2 4147.2 



10 4191.6 



4170.8 4175.3 



4137.3 4193.1 4122.1 
4187.2 4188.1 4102.1 



4136.1 4127.1 4191.1 

J I 



4119.1 



4119.2 



25 



The protein leader sequences of different known exported proteins were used as a 
starting point for a search of the local pneumococcus database described above. The 
hypothetical proteins found with this search were then submitted to a Blast search in 
general databases such as EMBL, Swissprot etc. Proteins remaining unknown in the 
pneumococcus are kept and annotated. Then the search is performed again using the 
new potential protein leader sequence as a probe, using the TfastA procedure. 
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Example 2: DNA vaccine trials 
pcDNA3.1+ as a DNA vaccine vector 
pcDNA3.1 + 

The vector chosen for use as a DNA vaccine vector was pcDNA3. 1 (Invitrogen) 
(actually pcDNA3. 1 + , the forward orientation was used in all cases but may be 
referred to as pcDNA3.1 here on). This vector has been widely and successfully 
employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et al , Kurar and Splitter, Anderson et al.)> The 
vector was designed for high-level stable and non-replicative transient expression in 
mammalian cells. pcDNA3.1 contains the ColEl origin of replication which allows 
convenient high-copy number replication and growth in E. coli. This in turn allows 
rapid and efficient cloning and testing of many genes. The pcDNA3.1 vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
immediate-early promoter /enhancer which permits efficient, high-level expression of 
the recombinant protein. The CMV promoter is a strong viral promoter in a wide 
range of cell types including both muscle and immune (antigen presenting) cells. 
This is important for optimal immune response as it remains unknown as to which 
cells types are most important in generating a protective response in vivo. A T7 
promoter upstream of the multiple cloning site affords efficient expression of the 
modified insert of interest and which allows in vitro transcription of a cloned gene in 
the sense orientation. 

Zhang, D., Yang, X., Berry, J. Shen, C, McClarty, G. and Brunham, R.C. (1997) 
"DNA vaccination with the major outer-membrane protein genes induces acquired 
immunity to Chlamydia trachomatis (mouse pneumonitis) infection". Infection and 
Immunity, 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/L12 gene elicits immune response". Vaccine, 15, 1851-57. 

Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, 
G. (1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Immunity, 64, 3168-3173. 

Preparation of DNA vaccines 

Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 
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primers were designed such that they targeted that portion of the gene thought to 
encode only the mature portion of the gene protein. It was hoped that expressing 
those sequences that encode only the mature portion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNA3.1 expression vector. The signal peptide directs the polypeptide 
precursor to the cell membrane via the protein export pathway where it is normally 
cleaved off by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the 
signal peptide does not make up any part of the mature protein whether it be 
displayed on the surface of the bacteria surface or secreted. Where a N-terminal 
leader peptide sequence was not immediately obvious, primers were designed to 
target the whole of the gene sequence for cloning and ultimately, expression in 
pcDNA3.1. 



Having said that, however, other additional features of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the 
design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain, 

4. Where no N-terminal signal peptide or LXXC was present the start codon was 
excluded. 

5. Where no hydrophobic C-terminal domain or LPXTG motif was present the stop 
codon was removed. 

Appropriate PCR primers were designed for each gene of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 
these primers. The primers were designed with the appropriate enzyme restriction 
site followed by a conserved Kozak nucleotide sequence (in all cases) GCCACC was 
used. The Kozak sequence facilitates the recognition of initiator sequences by 
eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of 
interest. For example the forward primer using a BamHl site the primer would 
begin GCGGGATCCGCCACCATG followed by a small section of the 5' end of the 
gene of interest. The reverse primer was designed to be compatible with the forward 
primer and with a Notl restriction site at the 5 ' end in all cases (this site is 
TTGCGGCCGC). 



PCR primers 
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The following PCR primers were designed and used to amplify the truncated genes 
of interest. 



ID210 

Forward Primer 5' CGGATCCGCCACCATGTCTTCTAATGAATCTGCCGATG 

3' 

Reverse Primer 5' TTGCGGCCGCCTGTTTAGATTGGATATCTGTAAAGACTT 

3' 

4172.5 

Forward Primer 5* 

CGCGGATCCGCCACC ATGGATTTTCCTTCA A ATTTGGAGG 3 ' 
Reverse Primer 5" TTGCGGCCGCACCGTACTGGCTGCTGACT 3' 

ID211 

Forward Primer 5' 

CGGATCCGCC ACCATGAGTG AG ATC A A AATT ATT A ACGC 3 ' 
Reverse Primer 5' TTGCGGCCGCCGTTCCATGGTTGACTCCT 3" 

4197.4 

Forward Primer 5' CGCGGATCCGCCACCATGTGGGACATATTGGTGGAAAC 
3' 

Reverse Primer 5* TTGCGGCCGCTTCACTTGAGCAAACTGAATCC 3' 
4122.1 

Forward Primer 5* 

CGCGGATCCGCCACCATGTCACAAGAAAAAACAAAAAATGAA 3 ' 
Reverse Primer 5* TTGCGGCCGC ATCGACGT AGTCTCCGCC 3 ' 

4126.7 

Forward Primer 5' 

CGCGGATCCGCCACC ATGCTGGTTGGAACTTTCT ACTATCA AT 3 ' 
Reverse Primer 5' TTGCGGCCGCAACTTTCGTCCCTTTTTGG 3' 
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4188.11 

Forward Primer 5' CGCGGATCCGCCACCATGGGCAATTCTGGCGGAA 3' 
Reverse Primer 5' TTGCGGCCGCTTGTTTCATAGCTTTTTTGATTGTT 3' 

5 

ID209 

Forward Primer 5' 

CGCGGATCCGCCACCATGCTATTGATACGA A ATGCAGGG 3 ' 
10 Reverse Primer 5' TTGCGGCCGCAACATAATCTAGTAAATAAGCGTAGCC 3' 

ID215 

Forward Primer 5' CGCGGATCCGCCACCATGACGGCGACGAATTTTC 3' 
1 5 Reverse Primer 5 ' TTGCGGCCGCTTA ATTCGTTTTTGAACTAGTTGCT 3 ' 

4170.4 

Forward Primer 5' 

20 CGCGGATCCGCCACCATGGCTGTTTTTCTTCGCTATCATG 3' 

Reverse Primer 5* TTGCGGCCGCTTTCTTCAACAAACCTTGTTCTTG 3' 

4193.1 

25 Forward Primer 5' 

CGCGGATCCGCCACCATGGGTAACCGCTCTTCTCGTAAC 3 ' 
Reverse Primer 5' TTGCGGCCGCGCTTCCATCAAGGATTTTAGC 3' 

Cloning 

30 

The insert along with the flanking features described above was amplified using PCR 
against a template of genomic DNA isolated from type 4 S. pneumoniae strain 11886 
obtained from the National Collection of Type Cultures. The PCR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 

35 pcDNA3. 1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing ~700 
base pairs through the 5" cloning junction of each large scale preparation of each 

40 construct. 



Strain validation 
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A strain of type 4 was used in cloning and challenge methods which is the strain 
from which the 5. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 S. pneumoniae strain NCTC 11886 was 
5 obtained from the National Collection of Type Strains. The ampoule was opened and 
the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37°C. This culture was streaked on 
to 5 % blood agar plates to check for contaminants and confirm viability and on to 
10 blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC 1 1886 was streaked on a 5% blood agar plate and 
15 incubated overnight in a C02 gas jar at 37°C. Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

20 A standard inoculum of type 4 5. pneumoniae was prepared and frozen down by 

passaging a culture of pneumococcus lx through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10 9 
cfu/ml in broth before freezing down. The preparation is set out below as per the 
flow chart. 

25 

Streak pneumococcal culture and confirm identity 

i 

V 

30 

Grow over-night culture from 4-5 colonies on plate above 



Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 

I 

40 V 



Grow over-night culture from animal passaged pneumococcus 
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I 

V 

Grow day culture (to pre-determined optical density) from over-night of animal 
passage and freeze down at -70 °C - This is standard minimum 



Thaw one aliquot of standard inoculum to viable count 



Use standard inoculum to determine effective dose (called Virulence Testing) 



20 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted 500x in PBS and used to inoculate the 
mice. 

25 

Mice were lightly anaesthetised using halothane and then a dose of 1.4 x 10 5 cfii of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

30 S. pneumoniae vaccine trials 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six 
and each group was immunised with recombinant pcDNA3. 1 + plasmid DNA 

35 containing a specific target-gene sequence of interest. A total of 100 /ig of DNA in 
Dulbecco's PBS (Sigma) was injected intramuscularly into the tibialis anterior 
muscle of both legs (50 fi\ in each leg). A boost was carried using the same 
procedure 4 weeks later. For comparison, control groups were included in all 
vaccine trials. These control groups were either unvaccinated animals or those 

40 administered with non-recombinant pcDNA3. 1 + DNA (sham vaccinated) only, 

using the same time course described above. 3 weeks after the second immunisation, 
all mice groups were challenged intra-nasally with a lethal dose of S. pneumoniae 
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serotype 4 (strain NCTC 11886). The number of bacteria administered was 
monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A 
problem with intranasal immunisations is that in some mice the inoculum bubbles out 
of the nostrils, this has been noted in results table and taken account of in 
5 calculations. A less obvious problem is that a certain amount of the inoculum for 
each mouse may be swallowed. It is assumed that this amount will be the same for 
each mouse and will average out over the course of innoculations. However, the 
sample sizes that have been used are small and this problem may have significant 
effects in some experiments. All mice remaining after the challenge were killed 3 or 

10 4 days after infection. During the infection process, challenged mice were monitored 
for the development of symptoms associated with the onset of 5. pneumoniae 
induced-disease. Typical symptoms in an appropriate order included piloerection, 
an increasingly hunched posture, discharge from eyes, increased lethargy and 
reluctance to move. The latter symptoms usually coincided with the development of 

15 a moribund state at which stage the mice were culled to prevent further suffering. 

These mice were deemed to be very close to death, and the time of culling was used 
to determine a survival time for statistical analysis. Where mice were found dead, 
the survival time was taken as the last time point when the mouse was monitored 
alive. 

20 

Interpretation of Results 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 

25 challenge. Protection was taken as those DNA sequences that gave statistically 

significant protection (to a 95% confidence level (p<0.05)) and also those which were 
marginal or close to significant using Mann- Whitney or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 

30 to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 
intranasal infections. 



WO 00/06737 



PCT/GB99/02451 



25 



3 

CD 

<= 

at 



00 

c 

"C 

.5 
'3 

u 

> 



CO 



CO 

3 
O 



C3 



<N GO 

Q 



u 

> 2 
5 1 



IT) 



o 
cj 
a 
> 
c 



c4 
o 

Q 



o , 

^ o 



§ i 



m tt to 



cc 



m 

00 



00 

o 



VO 

o 



IT) 

o 



o 



CO 



> 



o 

J .E e "8 

^ ^* 



00 

.s 

o 

5 2 
a- S 

i I 



3 
g ° 

•S E 

=s o 

12 f 

>» 

O 
C 



s? .1 

S3 c 



1 1 

fan H 



o 



O 

a 



c 

s 



eg u, 
a- 

X 
CD 



O 
T3 

o 
e 

CD 



T3 



T3 

c 



•a 



CO 



3 
c 



<3 



O 
(J 



CO 

u 
c 

CO 

u 

'E 



CD 



E 



5 a 



<D CD 

i 5 

Z e. 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 PCT/GB99/02451 

26 




SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



27 



PCT/GB99/02451 



u 
3 

O 

0> 



d 

*> 
u 

3 

C 

0> 



DJi 



z 

Q + 
a. 



2 S 

9 u 



° S 



Q w 



z w 
Q + 

cu 



II 
5 8 



Q w 



u _ 

5 « 

S t 



1m 

s g 



in 



in 



On 



CO 
00 



ON 

CO 

V 



o 

V 



o 



QO 



OS 
00 



o ^ 



00 

o 



a; 

3 



Q0 

a 

'55 
O 

-a 

c « 
.2 a. 

| § 



e 
o 
o 

*a 

u 

« « 
o .£ 
is o 
e o 
o « 
o > 



E 6 

O 3 



& ? 

a -g 

G «J 

S? £ 

« T3 

•C to 

5 1 

6 ~ 

C cd 

8. | 

"■«— I l/l 

O i 

■a « 



y T3 4- 

C u < 

> Q 

c u 

3 C 

o o 



D. 

6 
o 



a 
a 



" i2 o o 
*- o ~ w 



« e 
c — 

a 1 



> 



> 



SUBSTTTUTE SHEET (RULE 26) 



WO 00/06737 



PCT/GB99/02451 



28 



cl 

3 
O 



XJ 
oO 



XJ 

co 

CJ 

o 
c 

> 
CO 



on 

a 

i 

CO 



OO 
OO 

sz 

"8 

<3 
c 

'o 
u 

5 

CO 

CL 
3 

2 

CD 



C 

3 



+ 

on 

< 

Z 

Q 
o 
a- 

s 

CJ 

JS 
«—< 

T3 

.2 

<u 
cl 

t- 
a> 
oo 
c 
o 



CO 



_to 

s 

c 
o 
u 



.5? S 



co 
c 

< 

To 
o 



co 

55 



o 



o 
c 

sz 

OO 
3 

o 



ON 



E 



> 



c 

CO 

<u 
3 



c 
o 
a 

•o 

4> 
co 
3 

'o 
u 

CO 
> 

c 

3 



i 

o 



CO 



•3 P 
u 

CO 

> 



T3 

(U 
u* 
03 
CL 

E 

o 

O 

co 

E 



S3 
> 



3 

CO 
I— 

cU 
00 
3 

o 



3 
co 
a 



(U 

x: 



3 

C3 



0J 

00 
3 
O 



o 

in 
vd 



oo 
co 

u> 

> 
co 

3 
CO 



CL 
3 
O 

oo S 



00 



13 
a> 
co 

3 

'a 

o 
co 
> 
a 



sz 



c •= 2 

oo 



> 

*> 
w 
3 

"S 

co 

00 

.5 

e 

o 



^ -o 

« « P rs 



"2 
> 

> 

3 

to 

CL 
3 
O 

CO 

"o 



1 

o 

"to 

d 



CO 
CL 

E 

8 



3 



^ .E S 2 



Q 



-a 



o 
o 

CJ 

> 

CO 



2 oj 



CL "O 

3 U 

£ * 

oo .S 
o 

r- cj 

r- > 



CO 

co 



CO 
ON 



TT |0 S 



cn 
O 

d 
"S 

CJ 

oo 

o 
d 



CL 
3 
O 

oo 



x: 
H 

CO 

.S 
'o 

u 

CJ 

> 

CL 
3 
O 
u« 
00 



CO 

o 



o 

3 



cl) 



00 



o 
x: 



CO 
CL 



00 

oO 
C 

1 

> 
3 
3 

<D 

x: 



CO 
X2 



o 



I 

3 

C3 

CL 
3 
O 



CO 

a 

*u 
o 

CO 

> 



e i , 



co 
< 
2 

Q 

a 

CL 



3 

CO 
O 



3 
oO 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 PCT/GB99/02451 



29 



Table 1 



4101.1 

ATGGAAGAGTTAGTGACCTTAGATTGTTTGTTTATTGACAGAACTAAGATTGAAGCCAATG CCAAC AAGTATAGTT 
5 TTGTGTGGAAGAAAACGACAGAGAAATTCTCCGCCAAACTTCAAGAACAGATACAGGTCTATTTTCAAG 

TCACTCCCCTTCTGATTAAATATGCCATGTTTGATAAGAAACAAAAGAGAGGGTATAAAGAGTC AGCT AAAAACT 
TAGCGAATTGGCACTATAATGACAAGGAGGATAGCTACACACATCCTGATGGCTGGTATTATCGTTTTCACCATAC 
C AAAT ATCAG AAAAC AC AG AC AG A CITTC AACAAG AAATC AAGGTTT ACTACG CCG ACG AA C CTG A ATC AG CCCC 
TCAAAAGGGACTGTATATGAACGAACGCTATCAAAACTTGAAAGCTAAAGAATGTCAGGCGCTTTTATCT 
10 AGGTAGACAGATTTTCGCTCAACGCAAGATTGATGTGGAACCTGTCmGGGCAGATAAAGGCT^ 

AAGAGATGTAATCTGAGAGGGAAGCGTCAAGTGAGAATTGACATGGGATTGGTACTTATGGCCAATAACCTCCTA 
AAATATAGTAAAATGAAATAA 

4101.3 

15 ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCTGTGGGAGCTTGCTCAGTAATG 
ATTGGGACTrGTGCAGTTTTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGGATC 
CTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCrGATAAAGCITTGGAAACrAAA 
AATATAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCATCn'GAGAAGAAAGAAGATGA^ 
GCCGTAACrCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGCrCCAAGGATAGAATCACAAGCTTC 

20 AAATCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAATGAAGAAGTGAATCAAATGATTGAAGACA 
GGAAAGTGGATTTTAATCAAAATTGGTACTTTAAACTCAATGCAAATTCTAAGGAA 

ACGTATCTACGTGGAAAAAATTAGATTTACCGTATGACTGGAGTATCTTTAACGATTTCGATCATGAATCTCCTGC 
ACAAAATGAAGGTGGACAGCTCAACGGTGGGGAAGCTTGGTATCGCAAGACTrTCAAACTAGATGAAAAAGACCT 
C AAG AAAAATGTTCG CCTTACTTTTG ATG G CGTCT AC ATGG ATTCTC AAGTTT ATGTC AATGGTCAGTTAGTGGGG 

25 CATTATCCAAATGGTTATAACCAGTTCTCATATGATATCACCAAATACCTTCAAAAAGATGGTCGTGAGAAT GTGA 
TTGCTGTCCATGCAGTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTT 
ACAAGTGACAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACA 
TGGCAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAATA 
TCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCnTAGTTCGTACAGCGAGTCGTACCTTAAAAGCACATGA 

30 ATCAACAAG CCTAG ATG CG ATTTT AG AAGTTG AAAG ACCAAAACTCTG G ACTGTTTTAAATG ACAAACCTGC CTTG 

TACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGA^ 

ATCACTGGACTCCAAATGAAGGTTTCTCmGAATGGTGAACGTATTAAATTCCATGGAGTATCCTTGCACCACG^ 
CCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCCGTCTCAAACAAATGAAGGAGATGGGAG 
TTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAAA 

35 TCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCAAGAAACCnTATGACrATGGACGTTTCITTGAAAAAG 

CACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTACCATGGTCGAA AGAG GCAAAAA 
CAACCCTGCTATCITCATGTGGTCAATTGGTAATGAAATAGGTGAAGCrAATGGTGATGCCCACTCm 
GTTAAACGTTTGGTTAAGGTTATCAAGGATGTTGATAAGACrCGCTATGTTACCATGGGAGCAGATAAArrCCGTT 
TCGGTAATGGTAGCGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATITAACTATTCTGAAGATA 

40 ATTACAAAGCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTATGGATCAGAAACATCTTCAGCTACCCGTACACG 
TGGAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATTA 
TGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACT 

TGGACAGTTTATCrGGACAGGTACGGACTATATTGGTGAACCrACACCATGGCACAACCAAAATCAAACTCCTGTT 
AAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACITCTATCTCTACCAAAG 

45 TTTCTGTTAAGAAGAAACCGATGGTACACCTTCT^^ 

TAGCTGACTCAGAAGGTAAGATTCCAGTrCGTGCITATTCGAATGCrrCTAGTGTAGAATTGTTCI^ 
ATCTCTTGGTCnTAAGACITrCAATAAAAAACAAACCAGCGATGGGCGGACITACCAAGAAGGTGCAAATGCT 
TGAACirTATCTTGAATGGAAAGTTGCCTATCAACCAGGTACCTTGGAAGCAATTGCTCGTGATGAATCT 
GAAATTGCTCGAGATAAGATTACGACrGCTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATT 

50 GCAGCAGATGGAAAAGACITGACTTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCT 
AATAATCTGGTTCGCTTCCAATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGT 
GAACGCTATAAGGCGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAAT 
CAACTGAACAACCAGGGAAATTCACCCTG ACTG CCCACTCTCATCT^ 

TGGTAAGAAAGAAGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAGAGG 
55 CACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCTGCTCTTC 
AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGAAGCTCGTGTAGA 
AGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGT^ 

TCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAAGTGGACAAGTGGGAGATTGCCGAAGAAGATA 
AAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGTTATTTAGAAGGTCAACCAATTCATGCAACCCT 
60 TGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAACTGTAACGGTTGGTGGTGAGGCAGTAACAGGTCT 
TACTAGTCAAAAACCAATGCAATACCGCACTCTTGCnTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAA 
AAATGCAGCTGTTACAGTTCTTCAAGCAAGCGCAGCAAACGGCATGCGTGCGAGCATCITTATTCAGCCTAAAGA 
TGGTGGCCCTCTTCAAACCTATGCAATTCAATTCCTTC 

GAAAAAGCrGACAGTCTCAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAA 
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GCTGTATTACCAGCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGA 

GAGTTGCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACTATC 
CAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCAGGAACCA 
AGTCTTCC^GCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCAC^ 
5 CGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGACCTTGAAGCGCGTG 
CAAAAGTCTCTGTAGAAGGTATCGTTTC^GTTGAAGAAGTCAGTGTGAC 

TACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTAAGGTTGCATGGGATGCGATTCGTCC 
AGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTTAGAAGGTACGCAATTAACAACTAAACTTC^ 
TGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATTTCTGACCAATGGACCGGTTCAGAATTGCCACTTGCC 

10 TTTGCTTCAGACTCAAATCCAAGCGACCCAGTTTCAAATGTTAATGACAAGCTCA TTTCC TACAATAACCAAC 

CCAATCGTTGGACAAACTGGAATCGTACTAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTT 
GAGCAAACGCTCCGTTGATAATCTAAGTGTCGGATTCCATGAAGACCATGGAGTTGGTGTACCGAAGTCTTATGTG 
ATTGAGTATTATGTTGGTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGA 
TTAATGATTCTGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACrCAAGGCTGGAGAAATGAACC 

15 ACirTAGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAACAA 

TATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAAGTTGACGG 
C AAAG ACTT AGC AAACTTC AACCCTG ATTTG AC AG ACT ACT ACCTTG AGTCTGT AG ATGG AA AAGTTCCG GCAGTC 
ACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTGAGCCAGT TCGTG TCATCG 
CGAAAGCTGAAAATGGCGACATCrTAGGAGAATACCGTCTGCACTTCACTAAGGATAAGAGCTTACTTTCT 

20 AACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAAGCACTTGAATTGCCXjACT 

TTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGACAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCT 
GACAAAAGCAGGTCAATTrACTGTTCGAGGCCGTGTCCTTGGTAGTAACCTTGTTGCTGAGA TCACT GTACGAGTG 
AC AG AC AAACTTG GTG AG A CTCTTTC AG AT AACCCT AACT ATG ATG AAAAC AGT AACC AGG CCTTTG CTTC AGC A 
ACCAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTT 

25 GGACAAACTGGTCACCAACACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGAAAATGGTAAGA 
TTGTAGAACGGACTGTTACACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGT 
TirAGAACGCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCAT 
CCATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAAATC 
AACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGAGCGGTGTT 

30 GCGATGATTGAGATGACCTTCCnTGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCn^ 

GAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAAGGTCAACGGCCAAAAGTCT 
CAGTTGAAGAAAACAATCAAGTAGCnTCAACrGTGGTAGATAGTGGAGAAGATAGCTTTCCAGTACTTGTTCGC^ 
CGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAA 
AAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAAAATCGA^ 

35 GCATACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAACAAGAAGGAAAAGTT 
GG AAAAG AA CGT ATCTTT AC AGCG ATT AATCCTG ATGG AAGTAAGG AAG AAAAACTCCGTG AAGTG GT AG AAGTT 
CCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCACAAGTGTCAGAAAAA 
GCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAGTTACCAAGTACAGGTAGTGCG 
GCAAGCCAAGCAGCAGTAGCAGCAGGTTTAACrCTrCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAA 

40 AAAGAAGACTAG 



4101.5 

ATGGATGCAATCTTTGACCTAATCGGAAAGGTTTTCAATCCCATCTTAGA 
45 TCATirrGACAGTATTGGCTITACTTTTTGGAGTGAAATTCTCCAAAGCGCTTG 
CGCTCTTACAGGTATCGGTGCTATCATCGGTATGCTAAACACTGC^ 
AACACTGGTATCCAATTGAGTATTACOiACGTrGGTTGGGCACCAOT 

CACTATACTTCnTGCTCATCATGTTGATTGTCAACATAGTGATGCTAGCTATGAAGAAAACAGATACACTTGATGT 
CGATATCTTTGATATCTGGCACnTGTCTATCACAGGTCrcrrTGATTAAATGGTATGCTGATAACAATGGTGTGAGT 
50 CAAGGGGTTTCACTCTTTATTGCTACAGCAGCTATCGTCCTTGTCGGTGTGTTGAAAATTATCAACT 

GAAACCTACATTTGATGACCTTCTTAACGCCCCAAGTTCATCACCAATGACATCAACTCACATG AACTA CATGATG 

AACCCAGTTATCATGGTTTTGGATAAGATTTTTGAAAAATTCTTCCCAGGCCTrGATAAATATC 

CTAAATTGAACAAGAAAATCGGTTTCrGGGGATCTAAATTCTTCATCGGTTTCATCOT 

ATGGGAACTCCACATCCAATTGCAGGTGTTGCAGATGCAGATAAATGGCGTCTTGTTATCAAAGGATGGTTGTCTC 
55 TTGGTTTGACTGCCGGTGTATCITTGGAACrCTTCTCACTTATCGGTTCATGGTTCA^ 

TCAC A AGGT ATTAC A AACGTTG CT A CT AAACGTCTTC AAGG A CGTAAATTC AATATCG GTCTTG ACTG GCC ATTC A 
TCGCTGGTCGTGCrGAAATCTGGGCrrGTGCCAACGTACTTGCACCAA 

AAAAGTTGGAAATGGTATCTTGCCACTTGCAGGTATCATCGCTATGGGTGTTACTCCAGCTCrCTTGGTTGTAACT 
CGTGGTAAATTGCTCCGTATGATTATCTTCGGAACACT^ 
60 ACCATTTGCAACAGAACTTGCTAAAGGTGTAGGTGCCTTCCCAGAAGGTGTGAGCCAAACTCAATTGATTACTCAC 
TCTACTCrTGAAGGACCAATCGAAAAACTTCITGGTTGGACAATTGGTAACACT 
TCCTTGGTGCAGTAGTCTTCCITGTATTCTATATCGGTATCTTTGCITGG 
CGAAGAGTACGCAGCAAAAGCAAAATAA 

65 4102.1 
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ATGAAGATTATGAAAAAAAAATATTGGACTTTAGCGATATTATTCTTTTGTTTGTT 
AGAAATACCTAAAAATCTTGATGGCAATATAACTCACACTCAGACTAGCGAAAGTTTTTCT 
ACAGGTTGACTATTCTAATAAAAATCAAGAAGAAGTAGACCAAAATAAATTTCGTATTCAAATCGATAAGACAGA 
ATTATTTGTAACAACAGATAAACATTTAGAAAAAAACTGTTGTAAATTGGAAOTGAACCAC^ 
5 TATTGTTAACTCTGAAAGTAATAATTTACTAGGCGAAGATAATTTAGATAATAAAATTAAGGAAAA 

CTAGATAATAGAGGAGGAAATATAGAGCATGACAAAGATAACTTAGAATCGTCGATTGTAAGAAAATATGAATGG 
GATA TAGATAAAGTTACTGGTGGAGGCGAAAGTTATAAATTATATTCTAAAAGTAATTCTAAAGTTTCAAT^ 
TTTTAGATTCAGGAGTCGATTTACAAAATACTGGATTACTGAAAAATCTTTCAAATCACTCAAAAj^ 
CAATAAAGGATATTTAGGAAAAGAGGAGGGAGAGGAAGGAATAATATCAGATATTCAAGATAGATTAGGTCATG 

10 GTACGGCTGTTGTAGCTCAAATTGTAGGGGATGACAATATTAATGGAGTAAATCCTCACGTTAATATTAACGTCTA 
TAGAATATTTGGTAAGTCGTCAGCTAGTCCAGATTGGATTGTAAAAGCAATTTTTGATGCTGTAGATGATGGCAAT 
G ATATTA TCAATCTTAGTACTGGACAATATTTAATGATTGATGGAGAATATGAGGACGGAACAAATGATTTTGAAA 
CArirriGAAGTATAAAAAGGCTATTGATTACGCGAATCAAAAAGGAGTAATTATAGTAGCTGCATTAGGGAATG 
ACTCCCTAAATGTATCAAATCAGTCAGATTTATTGAAACTTATTAGTTCACGCAAAAAAGTAAGAAAACCAGG 

1 5 AGTAGTTGATGTT CCAA GTTATTTCTCATCTACAATTTCGGTCGGAGGCATAGATCGCTTAG 

GTAATTTATCAGATTTTAGCAATAAAGGGGATTCTGATGCAATATATGCGCCTGCAGGCTCAACATTATCT 

AGAATTAGGACTTAATAACTTTATTAATGCAGAAAAATATAAAGAAGATTGGATTTTT^ 

ATATACCrTATCTTTATGGAAACTCATTTGCTGCTCCT 

AATTAAAAGATCAGCCCTATAATTATATGTTTGTAAAAAAATTCTGGAAGAAACATTACCAGTAA 
4106.1 

ATGAAGAAAACATGGAAAGTGTTTTTAACGCTTGTAACAGCTCIT^ 

GAACTGCTTCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACTG 
ACCACACAGGGCTITATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAGTGGATGTTGATTTGA 

25 CACCAGAAGAAAGTrCTTCTGACTTGGTTATCAACGGAAAGGCACCATTTGCAGTGTATTTCCAAGACT 

TAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATTGTTGAACACAATACATCAGGAATCATCTC 
TCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTTGGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGA 
ACTTGCTATGTTGAAAACCTTGGTAGAATCTCAAGGTGGAGACTTTGAGAAGGTTGAAAAAGTACCAAA 
CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACT^ 

30 GCTAAATCTCAAGGTGTAGATGCTAACirCATGTACTTGAAAGACTATGTCAAGGAGTTTGACTACTA 

TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAAG 
GCTACCA ATAT GCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTjAACTCAAGGAAA 
AACGTGACTTTGTCATCGAAT CTCAA AAATACTTGTCAAAAGAATACGCAAGCGACAAGGAAAAATGGGGTCAAT 
TTG ACG CAG CTCG CTGG AATG CTTTCTACAAATGGG AT AAAG AAAATG GTATCCTT AAAG AAG ACTTG AC AG AC A 

J 5 AAGG CTTC ACCAA CG AATTTGTG AAATAA 

4106.4 

ATG ATAAAAA ATCCTAAATTATTAACCAAGTCr I" ITIT AAGAAGTTTTGCAATTCT AGGTGGTGTTGGTCTAGTCA 
TTCATATAGCTATTTATTTGACCnTTCCnTTTTATTATATTCAACTGGAG^ 
40 AGTGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCITACTCCAGTC^ 
ACCATATCnX3CTCACCTTAAAAGAGATATTGTAGATAAGCGGCTC 

G AAAG CT ATC AAATT ATATCGTG ATGTTAG AT ATGTCTGTT AGT ACAGC AG ATG GT AAAC AGGTAACCGTG C AATT 

TGTT CACGGGGTGGATGTCTA CAA AGAAG CAAAGAATATTTTGCTTITGTATCT 

ATTGlin 1 1 CCTTTGnTlTlCn ATTTTTATA CTAA ^ 

45 AGTAAAATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTC 

CAGATTAATGGTATGTATG AGCA CnTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAA^ 
TGCAAAATCAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCCCTTTAGCCAGTCTTAGAATTA^ 
CCTAGAGAATATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGAT 
TGACC^GATGAGCCA CTTATT AGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGAC 

50 CTTGACTGTTAAGCCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCATTCA^ 

AAAATCAATTGACAGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTA 

GTAATGCAATTAAATATTCAG ATA AAAATGGG CGTGTAATCATATCCG AG CAAG ATGG CTATCTCTCTATCAA AA 

ATACATGTGCGCCTCrAAGTGACCAAGAACTAGAACATTTATTTGATATATrCTATCATTCTCAAA 

TAAGG ATGAAAGTTCCGGTTTGGGTCTTTACATTGTG AATAATATTTTAG AAAGCTATCAAATGG ATTATAGTTTT 

55 CTCCCTTATGAACACGGTATGGAATTTAAGATTAGCTTGTAG 

4106.6 

ATGTATTTAGGAGATTTGATGGAGAAAGCCGAGTGTGGTCAATTTTCAATACITrCCI^ 

AGACGACCG TCAA GGCTGTAATGGAAGAAACAGGATTTTCAAAAGCAACCCTAACCAAATATGTCACCCTGCTCA 

OO ATGACAAGGCTTTGGATAGTGGCITAGAGCTGGCTATTCACT 

AGCTACCAAGGGGAGAGATATTCGGAGCnTGTrTTTGGAGAGTGCTGTTAAATACCAGATTTTGGTT^ 
TACCACCAACAGTTT TTAG CCCATCAGCTGGCrCAAGAATTGGTGATTAGCGAGGCTACGCTTGGTCGTCACT 
CTGGTTTAAATCAGATTT TGTCAG AATTTGATTrATCCATCCAAAATGGCCGTTGGCGAGGTCCAGAGCATCAGAT 
TCACTATTTCTATTTCTGTCH 1 1 CCGAAAGGTCTGGTCGAGTCAGGAATGGGAAGGTCACATGCAGAAACCAGAG 

05 AGAAAACAGGAGATTGCCAATTrAGAGGAAATCTGCGGTGCAAGrTTGTCTGCGGGGCAGAAATTGGAClTGG^ 
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CTCTGGGCTCACATCAGTCAACAACGTCnTCGGGTCAATGCTTGTCAGTTTCAAGTCATAGAAGAGAAAATGCGA 
GGGTATTTTGACAATATCnTTATCTTCGTTTGCTGAGAAAGGTTCCGTCCTTTTT^ 
AGGACTITGAGGATGGTGAGATGATGATATTCTTCTCTrrTCTCCTATCTCATCGCAT^ 
AGTATATTCITGGTTTTGGAGGGCAGTTGGCAGATTTACTGACGCA 
5 TATTGGGGGATTATACAGAGGACCATGTCACCTATGAACTCAGTCAGCnTrcTGCTCAAGTCTATCT 

CTATATTTTACAGGATCGCTACAAGTACCAGTTAGAGAATCGTCATCCATATTTACTGATGGAACATGATTTTAAA 
GAGACAGCAGAGGAGATTTTTCATGCTCTACCTGCTTTTCAACAGGGGACAGATTTAGATAAGAAGATTCTCTG^ 
AATGGCTCCAGTTAATCGAATATATGGCTGAAAACGGTGGCCAGCATATGCGGATTGGTCTGGATTTGACATCTGG 
TTTTCTTGTOTTTCAAGGATGGCAGCCATTTO 
10 ATGACCCTAGTCGGCATTATGATTTGCTGGTTACCAATAACCCGATTCATAAGAAGGAACAGACACCAGTCTATTA 
TTTAAAAAATGACTTGGATATGGAGGATTTGGTAGCGATTCGCCAGTTATTATTCACTTAA 

4106.7 

ATGGAATTTTCAAAGAAAACACGTGAATTGTCAATTAAAAAAATGCAGGAACGTACCCTGGACCTCTTGATTATC 

15 GGTGGAGGAATCACAGGAGCTGGTGTAGCCTTGCAGGCXiGCAGCTAGCGGTCTTGAGACTGGTTTGATTGAAATG 
CAAGACTTTGCAGAAGGAACATCTAGTCGTTCAACAAAATTGGTTCACGGAGGACTTCGTTACCTCAAA 
ACGTAGAAGTGGTCTCAGATACGGTTTCTGAACGTGCAGTGGTTCAACAAATCGCTCCACACATTCCAAAATCAG 
ATCCAATGCTCTTACCAGTTTACGATGAAGATGGAGCAACCTTTAGCCTCTrCCGTCTTAAAGTAGCCATGG 
GTACGACCTCTTGGCAGGTGTTAGCAACACACCAGCTGCGAACAAGGTTTTGAGCAAGGATCAAGTCTTGGAACG 

20 CCAGCCAAACTTGAAGAAGGAAGGCTTGGTAGGAGGTGGAGTGTATCITGACTTCCGTAACAACGATGCGCGTCT 
CGTGATTGAAAACATCAAACGTGCCAACCAAGACGGTGCCCTCATTGCCAACCACGTGAAGGCAGAAGGCITCCT 
CTTTGACGAAAGTGGCAAGATTACAGGTGTTGTAGCTCGTGATCTCTTGACAGACCAAGTGTTTGAAATCAAGGCC 
CGTCTGGTTATTAATACAACAGGTCCTTGGAGTGATAAAGTACGTAATTTGTCTAATAAGGGAACGCAATTCTCAC 
AAATGCGCCCAACTAAGGGAGTTCACTTGGTAGTAGATTCAAGCAAAATCAAGGTTT CACAG CCAGTT^ 

25 ACACAGGTTTGGGTGACGGTCGTATGGTCTTTGTTCTCCCACGTGAAAACAAGACTTACTTTGGTACAACTGATAC 
AGACTACACAGGTGATTTGGAGCATCCAAAAGTAACTCAAGAAGATGTAGATTATCTACTTGGCATTGTCAACAA 

CCGCTTCCCAGAATCCAACATCACCATTGATGATATCGAAAGCAGCTGGGCAGGTCTTCG 

TCCATTGATTGCAGGGAACAGTGCCTCTGACTATAATGGTGGAAATAACGGTACCATCAGTGATGAAAGCTTTGA 
CAACTTGATTGCGACTGTTGAATCTTATCrCTCCAAAGAAAAAACACGTGAAGATGTTGAGTCT 

30 CTTGAAAGTAGCACATCTGAGAAACATTTGGATCCATCTGCAGTTTCTCGTGGGTCTAGCITGGACCGTGATGACA 
ATGGTCTCTTGACTCTTGCTGGTGGTAAAATCACAGACTACCGTAAGATGGCTGAAGGAGCTATGGAGC^ 
TTGACATCCTCAAAGCAGAATTTGACCGTAGCTTTAAATTGATCAATTCTAAAACTTACCCr 
ATTGAACCCAGCAAATGTGGATTCAGAAATCGAAGCCTITGCGCAACTTGGAGTATCACGTGGTTTGGATAGCAA 
GGAAGCTCACTATCTGGCAAATCITTACGGTTCAAATGCACCGAAAGTCTTTGCACTTGCT 

35 GCGCCAGGACTCAGCTTGGCAGATACTTTGTCCCTTCACT^ 

ACTTCCTTCTTCGTCXrTACCAATCACATGCrCTTTATGCGTGATAGCTTGGATAGTA 

GAAATGGGACGATTCTATGACTGGACAGAAGAAGAAAAAGCAACTTACCGTGCTGATGTCGAAGCAGCTCTCGCr 
AACAACGATTTAGCAGAATTAAAAAATTAA 

40 4106.8 

ATGATGAATGAATTATTTGGAGAATTTCTAGGGACTTTAATCCTGATTCTTCTAGGAAA 

TGGTTCTTCCTAAAACCAAGAGCAATAGCTCAGGTTGGATTGTGATTACTATGGGTTGGGGGATTGCAGTTGCGGT 
TGCAGTCTTTGTATCTGGCAAGCTCAGTCCAGCTTATTTAAACCCAGCTCTGACCATCGGTGTGGCC TrAA AAGGT 
GGTTTGCCTTGGGCTTCCGTTTTGCCTTATATCTTAGC^ 
45 GTTGCAATTCAAACCTCACTATGAGGCAGAAGAAAATGCAGGCAATATCCTGGCAACCTTCAGTA CTGGACCAG C 
CATCAAGGATACTGTATCAAACTTGATTAGCGAAATCOT 

GTCTTTACGACTTTCAGGCAGGTATCGGAACCnTrGCAGTGGGAACTTTGATTGTCGGTATCGGTCTATCACT 
TGGGACAACAGGTTATGCCTTGAACCCAGCTCGTGACCnTGGACCTCGTATCATGCACAGCATCTTGCCAATTCCA 
AACAAGGGAGACGGAGACTGGTCTTACGCTTGGATTCCTGTTGTAGGCCCTGTTATCGGAGCAGCCTTGGCAGTG 
50 CTTGTATTCTCACTTTTCTAG 

4106.10 

ATGAAAAGGACCTGGAGGAACTCATTCGTGACAAATCTTAATACACCTTTTATGATTGGCAATATTGAGATTCCCA 
ATCGTACCGTTTTAGCGCCTATGGCTGGCGTGACCAACTCAGCCTTTCGTACTATCGCAAAGGAGCTCGGAGCTGG 

55 ACTCGTTGTAATGGAAATGGTCTCTGACAAGGGAATCCAATACAACAACGAAAAAACCCTGCACATGCTTCATAT 
CGATGAGGGCGAAAACCCTGTCTCTATCCAACTTTTTGGTAGCGATGAAGACAGCCTAGCACGCGCAGCAGAATT 
CATCCAAGAAAACACCAAGACCGATATCGTCGATATCAACATGGGCTGCCCTGTCAACAAAATCGTGAAGAACGA 
AGCTGGTGCTATGTGGCTCAAGGATCCAGACAAGATTTACTCCATCATCAACAAGGTCCAGTCTGTCCTTGATATC 
CCACTTACTGTCAAAATGCGTACCGGCTGGGCGGACCCATCTCTTGCAGTAGAAAATGCTCTCGCTGCTGAAGCTG 

60 CAGGTGTTTCTGCCCTCGCCATGCATGGCCGTACCCGTGAACAAATGTATACTGGCCACGCAGACCTTGAGACCCT 
TTACAAGGTTGCCCAAGCTCTAACCAAGATTCCATTCATCGCCAACGGTGATATCCGTACTGTCCAAGAAGCCAA 
GCAACGCATCGAAGAAGTTGGTGCTGACGCAGTCATGATTGGCCGAGCTGCCATGGGAAATCCTTACCTCTTCAA 
CCAAATCAACCATTACTrrGAAACAGGAGAAATCCTACCTGATTTGACCTTTGAAGAC/^ 

CGAACACTTGAAACGATTGATTAACCTCAAAGGAGAAAACGTCGCAGTTCGTGAATTCCGCGGTCTCGCTCCTCA 
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CTATCTCCGTGGAACATCTGGCGCTGCCAAACTCCGTGGAGCCATTTCGCAAGCCAGCACCCTGGCAGAGATTGA 
AACCCTCTTGCAATTGGAGAAGGCTTAA 



5 

4107.1 

ATGACAAAGAAGAAAATTGAGCGTATTTCTGTAATACACCGAGAAAAGATTTTATGGCTCAAGTGGTATTTCATGC 
GAGATAAAGAACAACCTAAGTATAGTGTCCTTGAGCGTAAAATGTTTGATGCTGCTAAAAATCAAG ATATG CTAG 
10 CTTATCAAAAATACGCAACTATCAAGCAGATAACAGATATTAGGGTACAAACAAGTGAGGCTGACATTTTAGAGG 
CTGTAAAAGAGGTTTATGTGTACAATCACATGAATGTTATCGGAGCTTGT CAGCG GATATTATTTATCAGTCAATC 
ACCAGCTTATGATAAGTTAAATAAGTGGTTTAATATCTATTCTGATTTGTATTTTAGCGTT 
TGGGGGTATATCATGAGATGGTAGGTATCTAG 

15 4107.2 

ATGAAAAATTCCAACGAGGCTGAGATGAAATTACTTTATACTGATATTCGGACTTCm 
GAGAGGCAGAAGAGCTAGTTGCAGCTGGCAAGCGGGTr^CTACATTGCCCCCAACTCT 
ACGCGCCGTGCTGGAATACTTGTCCCAGCAGGCTTCTTTTTC^ 
TATCTGGTCTTGAATGATTTACCAGCTAAAACTACTCTTGATGAT^ 
20 TGCCGAACTCGATCCCAAGGACTTGCGTGTTTATGGCGCTATTAAGCAGGATCCTCAATTGATCCAGCAGTTAATT 
GAGCTTTACCATGAGATGACCAAATCTCAGATGAGTTTTTTGGACTTGGAGAAm 
GCGGATTTACTCTTGATTTTTGAGAAAGTAACAGCCTATCTTAATCAAGGTCAGTTA 

CCCATTTGATTGAGGCTATTGAGAATGACAAGGTAAGTAGTGATTITAATCAAATCGCCTTGGTCATTGACGG 
TACTCGTTTTTCTGCTGAGGAAGAGCGGGTTGTGGACTTACrrCACGGCAAAGGTGTTGAGATTGTTAT^ 
25 TATGCTAGTAAGAAAGCCTATACCAGTCCTTTTAGCGAGGGCAATCTCTACCAAGCCAGCGTA AAATT TCTCCATC 
ATCTGGCTrCTAAATACCAAACGCCTGCTCAGGACTGTTCTCAAACTCATGAGAAGATGGATAGTTTTGACAAG^ 
CTCTCGTTTGTTGGAGTCTTCTTATGACTTTTCAGAACT 

CAAATCTGGTCTTGTTTGACGCAAAAGGAGGAGTTGGAGCTAGTAGCCCGTAGTATTCGTCAGAAATTACATGAG 
AACTCAGACCTGAGCTACAAGCATTTTCGTATTCTCTTGGGGGATGTAGCTTCTTACCAGTTATCT 
30 TTTTTGACCAGTATCAGATTCCTTTTTATCTTGGTAGAAGCGAAGCCATGGCTCATCATCCCnTGACT 
GAGTCTATTTTAGCTTTAAAACGTTACCGTTTTCGTCAGGAGGATTTGATTAATCTTCTT 

TG ACCTC AGTC AGTCTG AT ATTG ATG CTTTTG AG C AAT AT ATCCG CT ATCTTGGT ATCAATG GCTTGCC AG C CT TTC 
AGCAAACCrrCACCAAATCCCACCATGGAAAATTTAATCTTGAGCGTTTGAATCTCCTCCGCCT^ 
ACCTCTTGAAACCCTCTTTGCCAGCCGAAAACAAAAGGCTGAAAAACT^ 
35 AGAAGGAGCTGTGACCAAGCAGTTACAAGATTTGACAACCACTTTGGAAGCTGTAGAACAGGAAAGACAAGCCG 
AAGTTTGGAAGGCTrrCTGCCATGTTTTAGAACAATTrcCGACTG 

CTTCCTAGCCTTGCTCCATTCTGGAATGAGTTTGTCCCAATACCGTACCATTCCAGCAACAGTGGACACTGTTCT 

GTGCAGAGTTACGATTTGATTGCACCATTGACTGCTGACTTTGTCTATGCTATTGGACTAACTCAGGACAATTTAC 

CAAAAATTTCTCAAAACACCAGTCTTCTGACAGATGAAGAAAGGCAAAACCTAAACCAAGCGACCGAAGAAGGC 

40 GTTCAATTACTGATTGCCAGCAGTGAAAATCTCAAGAAAAATCGCTACACTATGCmC 

GTAAGCAGTTGTTCTTGTCGGCTCCAAGCCTITTTAACGAAAGTGAAAGTAAGGAATCTGCCTATC^ 
GATCCATTTTGGATTTAGGCGGAGAGAGAAGAGGATGAATCACAAAGGACTGTCTAAGGAGGATATGGGG TCCTA 
TCACAGTCTTTTGTCTAGTCTGGTTGCCTATCACCAGCAGGGTGAGATGAGCGATACTGAGCAAGATTTGACTTTT 
GTCAAGGTTCTGTCGCGTGTCATAGGTAAAAAACTAGATCAGCAAGGTCTGGAAAATCCAGCTATCCCAACCAGT 

45 CCAAGCAGCAAGACCTTAGCCAAGGACACCTTGCAAGCTCT 

CTATCCTGCCAAACAGGAGTTTTACCTGTCTACGTCGGGTTTGACAGAGTTTTATCGCAATGAATACAGT^ 
CTACGCTACGTTTTAGGCITGCAGGAGGAATTACGTTTGCATCCTGATGCCCGTAGTCACGGGAATTrCT^ 
GTATCTTTGAACGCGCCTTACAGTTGCCTAATGAAGATTCCnTrGACCAACGTCTAGAAC 
CAGTCAAGAACGCGAATTTGAAGCTATTTATCAAGAAAGrTO 

50 GTTGCACGGACAACTGGACATATTCTCCGACACAATCCAGCCATCGAAACCATCAAAGAAGAAGCAAATTTTGGT 
GGAAAAGACCAAGCCTITATrCAATTAGACAATGGACGCAGTGTCTTTGTACGAGGCAAGGTGGACCGGATTGAC 
CGTTTGAAAGCTAATGGAGCGATAGGAGTAGTAGACrACAAATCCAGTCTGACTCAGTTCCAGTTrCCTCATTTCT 
TTAATGGGCTCAATTCTCAGTTACCAACCTATCTTGCTGCCCTAAAAAGAGAAGG 

CATGTACTTGGAAATGGCTGAACCTGTCCAATCTCTGATGGCGGTAAAAAGTCTGGCAGGAGCAGTGGTAGAAGC 
55 CAGCAAATCTATGAAATACCAAGGGCTCTTCTTGGAAAAAGAAAGCAGTTATTTAGGCGAArm 

CAAGGCTAATCAACTGACAGATGAGGAATTTCAGCTCCTACTGGACTACAATGCCTATCTTTACAAGAAAGCT 
GAGAAGATTTTAGCAGGCCGGTTCGCCATCAATCCTTATACTGAAAATGGCAGAAGCATTGCCCCATACGTCCAG 
CAACATCAGGCTATTACAGGCTTrGAAGCCAATTACCATCTGGGCCAAGCCCGTTTCCTAGAAAAGTTGGACCT 
CTG ATG G C AA GCGTCTG GTCG G AG AAAAACTC AAG C AAGCTTG G CTTG AAAAAAT AA 
60 GAGAGGAGTTGAATCGATGA 

4107.3 

ATGAAGCTTATTCCCTTTTTAAGTGAGGAGGAGATrCAAAAACTGCAAGAAGCAGAAGCAAATTCGAGCAAGGAA 
CAGAAGAAAACTGCCGAGCAAATCGAAGCTATCTACACTTCTGCCCAGAATATCCTGGTCTCAGCATCGGCTGGT 
65 TCTGGAAAGACCTTTGTCATGGCAGAGCGCATTCTGGACCAATTGGCGCGTGGTGTC^ 
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TCTCAACCTTTACCCTCAAGGCTGCAACTGAACTTAAA 

AAACAGATGATGTCGACCTCAAACAACACTTGGGTCGCCAGTTGGCAGACCTACCCAATOCTGCCAT TGGAA CCA 

TGGATTCTTTCACACAAAAATTCCTTGGCAAACATGGTTATCTGCTTGATATTGCACCT 

AACCAAAGCGAGCAACTTATTCTCGAAAACGAAGTCTTTCATGAGGTCTTTGAAGCGCA 

AAAGAGACCTTTAGTCATTTGCTGAAAAACTTTGCTGGGCGTGGCAAGGACGAACGG 

TATAAAATCTATGACTTCCTCCAATCCACCAGTAATCCTCAAAAGTGGCTGAGTGAATCTITCCT 

AGAAAGCTGATTTTACCAGTGAAAAAGAAAAACTGACCGAGCAAATCAAACAAGCCCTI^ 

TTTTCCGTTACCATCTGGATAACGATGCCAAGGAGTTTGCAAAGGCTGCCTATTTAGAAAATGTTCAGTTAAl^ 

GGATGAAATTGGCTCCCTAAATCAGGAGTCCGATAGTCAGGCTTATCAGGCAGTGCTTGCGCGTGTTGTCGCCATC 

TCTAAGGAGAAAAACGGTCGAGCTCTGACTAATGCCAGCCGTAAGGCTGATTTGAAGCCCCTGGCTGATGCCTAC 

AACGAAGAGAGAAAGACCCAGTTTGCTAAACTAGGACAATTATCAGACCAGATAGCGAT 

TCTCGACTATCAAGAACGTTATCATGGAGACACTTGGAAACTAGCTAAAACCTTC 

GTAGAGGCTTATCGTCAGAGAAAACGACAGGAAAATGCCTTCGAATTCGCTGATATCAGCCATTACACCATTGAG 

ATTTTAGAGAATTTCCCACAAGTTCGTGAGTCTTATCAGGAGCGCTTCCATGAAGTCATGGTCGATGAGTATCAGG 

ATACCAACCATATTCAAGAACGGATGCTGGAATTGTTGTCTAATGGCCACAATCGCTTTATGGTGGGAGATATCAA 

GCAATCCATCTATCGTTTCCGTCAGGCAGACCCGCAGATTTTCAATGAGAAATTCCAACGCTATGCGCAAAATCCC 

CAAGAAGGCAGGCTCATTATCCTCAAGGAAAATTTCCGTAGTAGTTCAGAAGTGCTGTCAG CAACC AATGATGTC 

TTTGAACGTCTCATGGACCAAGAGGTCGGCGAAATCAACTATGATAA^^ 

AACTGACTCCCAATCCAGACAACAAGGCAGCATTTCTCCTCTACGACAAGGACGATACAGGTGAGGAAGAAGAGA 

GTCAAACAGAAACGAAACTAACAGGCGAAATGCGCTTAGTTATCAAGGAGATTCTGAAACTTCATCAAGAAAAAG 

GTGTTGCCTTTAAGGAAATTGCCCTTCTGACCTCCAGCCGCAGTCGTAATGACCAGATTCTCCTCGCCCTGTCTGA 

GTACGGAATTCCTGTCAAAACTGACGGAGAGCAAAACAATTATCTCCAATCCCTAGAAGTGCAAGTCATGCTAGA 

CACTCTTCGTGTCATTCACAATCCCCTTSCAAGACT^ 

ATGAGGATGAGCTAGCACGTTTGTCCCTTCAGAAAGCAGAGGATAAAGTCCACGAAAATCTCTATGAGAAACTGG 

TCAATGCACAAAAAATGGCAAGTAGTCAAAAAGGCTTGATTCACACAGCTCTAGCTG 

AAAAACTAAAGCAATTCATGGATATCCTAGCTTCTrGGCGCTTGTATGCCAAAACCCACTCTCTCT^ 

TTGGAAGATITACAACGACCGTrmATTATGACTATGTTGGGGCTTTGCCGAATGGTCCT^ 

CTCTATGCCCTAGCACTGCGTGCTGATCAATTTGAAAAGAGCAATTTCAAAGGTTTGTCGCGTTT^ 

TTGACCAAGTCTTAGAAGCCCAGCACGATTTGGCAAGCGTGGCCGTCGCACCGCCAAAAGATGCAGTAGAGCTCA 

TGACCATCCACAAGAGTAAAGGGCTGGAGTTTCCTTACGTCTrrATCCTCAATATGGATCAAGATTTCAACAAGCA 

AGACTCTATGTCAGAAGTCATTCTCAGTCGTCAGAATGGTCTTGGTGTCAAATATATTGCCAAGATGGAGACAGGG 

GCAGTAGAAGACCACrATCCTAAAACCATCAAACTCTCCATTCCTAGTCTGACCTATAGGCAGAACGAAGAGGAA 

TTACAGCTAGCAAGCTATTCTXiAGCAGATGCGTTTGCTGTATGTTGCTATGACGCGGGCTGAGAAAAAGCT 

TTGTCGGCAAGGGTTCTCGTGAAAAGCTGGAATCCAAGGAATACCCAGCAGCCAAAAATGGGAAACTAAATAGCA 

ATACTAGACTGCAAGCACGGAATTTCCAAGATTGGCTTTGGGCTATCAGTAAAGTGTTTACT 

ACTTTAGTTATCGTTTTATTGGCGAAGATCAGTTGACCAGAGAAG 

CCAAGATAGCTCCCAAGCAGACAATCGTCAGTCAGATACCATCAAAGAAGCTCTGGAAATGCTGAAGGAGGTGGA 

AGTTTATAATACTCTTCACCGCGCAGCTATTGAACTTCCTAGTGTTCAAACCCCAAGTCAAATCAAGAAAT^ 

GAACCAGTTATGGATATGGAAGGTGTCGAGATTGCTGGTCAAGGTCAGTCAGTAGGCAAGAAAATCAGCTTCGAT 

TTGCCAGATTTTTCAACCAAAGAAAAGGTAACTGGAGCTGAGATTGGTAGTGCTACTCACGAACTCATGCAGAGA 

ATTGACCTCAGCCAGCAACTAACCCTTGCTAGCCTAACAGAAACACTCAAACAAGTTCAAACTAGCCAAGCT 

AGAGACAAGATCAATCTTGATAAAATTCHTGCTTTCTTTGACACAGTACTCGGTCAGGAAATTOT 

A CC ATCTTT ATCG CG AGC AACCTTTCTCC ATG CTC AAACG AG ACC AAAAG AGTCAG G AAG ACTTTGTTGTCCGTG G 

TATCCTTGATGGCTATCTGCTTTACGAAAACAAAATTGTTCTGTTCGACTACAAGACAGACCGCTATGATGAA^ 

AGTCAACTCGTAG ACCGCTATCGTGGTC AGTTAG CTCTATACG AAG AGGCTTTATCACG AG CCTATTCG ATTG AAA 

ATATTGAAAAATACITGATTTTACTCGGTAAAGACGAGGTTCAAGTTGTAAAAGTATAA 

4109.1 

ATGGAACTTGCrCGCCATGCTGAAAGCTTGGGAGTAGATGCTATTGCAACGATTCCACCAAm 
CAGAATACTCAGTTGCCAAATACTGGAACGATATCAGTTCTGCAGCTCCAAACACAGACTACGTGATTTACAACA 
TTCCTCAATTGGCAGGGGTTGCTTTGACTCCAAGCCTTTACACAGAAATGTTGAAAAA rCCTCGTGTTATCGGTGT 
GAAGAACTCTTCTATGCCAGTTCAAGATATCCAAACCTTTGTCAGCCTTGGTGGAGAAGACCA^ 
GGTCCTGATGAGCAGTTCCTAGGAGGACGCCTCATGGGGGCTAGGGCTGGTATCGGTGGTACTTATGGTGCTATGC 
C AG AACTCTTCTTG AAACTC AATCAGTTG ATTG CGG AT AAG G A CCT AG AAA C AG CGCGTG AATTGC AGT ATG CT AT 
CAACGCAATCATTGGTAAACTCACTTCTGCTCATGGAAATATGTACGGTGTCATCAAAGAAGTCTTGAAAATCAAT 
GAAGGCTTGAATATTGGATCTGTTCGTTCACCATTGACACCAGTGACTGAAGAAGATCGTCCAGTTGTAGAAGCG 
G CTG CTG CCTTG ATTCGTG AAACC AAGG AG CG CTTCCTCT AA 

4110.2 

ATGTATAAGACAAAGTGTTTACGAGAGAAGTTAGTATTATnrTTTAAAAATTTTCTrCCCAATCCT 
TTGCCAATTATTCTGCCTCTTTTGTTGATACTGCAATGACAGGTCAATACAACACTATGGACTTGGCTGGTGTATCT 
ATGGCAACCAGTATCTGGAATCCTTTCTTTACArrTCTAACAGGGATTGTGTCAGCCTTGGTGCCTATCA 
CCATCTTGGTCGAGGCAAAAAGGAAGAAGTTGCGTCTGATTTTTACCAATTTAT^ 

GTGGTCTTGCTGGGGATGGTACTTTTCTTGGCACCAATAATCTTGAATCATATTGGGTTAGAAGCAGCAGTAGCGG 
CAGTAGCGGTTCGCTATCTTTGGTITTTATCTATCGGGATTATCCCCTTGTTGCTCTTTAGCG 
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TGGATTCGCTGGGCTTGACCAAACTGTCCATGTACCTCATGCTTTTGTTACTCCCTCTCAATAGCG 

CTCnTGATTTACGGTGCCTTTGGTGTTCCAGAACTGGGAGG 

GGGTCTTGCTTGGGATTTCTGTTCTGGTTTTATTTAAACAGGAGAAGCT^ 

TCCACTTAATATGGATAAAATTAAGGAAGGAGTrCGTTTAGGTCTGCCTATTGGGGGAACTGTCITCGCGGAA 
5 GCTATCTTTTCAGTGGTTGGCTTGATTATGGCTAAGTTTTCGCCCITGATTATAGCTAGTCACCAGTCAGCT 
CTTTTCAAGTCTTATGTACGCCTTTCCTATGAGTATCTCATCGGCTATGGCTATTGTCGTTTCCTATC 
CCAAGCGATTTGATGATGCGAAAACCTATATTGGTCTAGGAAGATGGACTGCCCTCATTTTTGCGGCCT^ 
AACCTTCCTTTACATTTTTAGGGGAAATGTGGCCAGTCTTTATGGTAACGACCCAAAATTTA 
CGTTTTITAACrrTATAGTCriTrTCTTCCAGTTAGCAGATACCT 
10 TAAGGATACAGTTATTCCTTTTTACCITGGTTTGCTTGGTTATTG 

4112.2 

ATG AGTACTTTAGCAAAAATAG AAGCG CTCTTGTTTGTAGCGGGTG AAG ATGGG ATTCGGGTCCG CCAGTTAGCT 
GAACTCCTCTCTCTGCCACCGACAGGCATCCAGCAAAGTrTAGGAAAATTAGCCCAGAAGTATGAAAAG GACC CA 
15 GATTCCAGTTTGGCTITGATTGAGACAAGTGGTGCTTATAGATTGGTGACCAAGCCTC^^ 

AGGAATACTCTAAGGCGCCTATCAACCAGAGCTTGTCTCGGGCTGCCCTTGAGACCnTGTCCATrATTGCCTAC^ 
ACAGCCGATTACGCGGATAGAAATTGATGCCATCCGTGGAGTTAACTCGAGTGGAGCCTTGGCAAAGTTGCAGGC 
TTTTGACCTGATAAAGGAAGACGGGAAAAAGGAAGTATTGGGGCGCCCCAACCTCT^ 

CCTAGATTACATGGGGATAAACCATTTAGAAGAATTACCAGTGATTGATGAGCTTGAGATTCAAGCCCAAGAAAG 
20 CCAATTATTTGGTGAAAGGATAGAAGAAGATGAGAATCAATAA 

4113.1 

ATGGATACGATGATTAGTAGATTTTTTCGCCATTTATTTGAAGCCTTAAAAAGTTTG 
CAGTAGCTGCTGTCAGTTCAGTCATGATTACTTTGACCTTGGTGGCAATATTTGCATCTGTTATTT^ 
25 AAACTAGCTACAGATATTGAAAATAATGTCCGTGTAGTAGTTTATATCCGAAAGGATG TGGA AGATAATAGTCAG 
ACAATTGAAAAAGAAGGTCAAACTGTTACAAATAATGACTACCACAAGGTATATGATTCTTTGAAGAACATGTCT 
ACGGTTAAAAGTGTTACCTTTTCAAGTAA 

AAAATCTITGAAGGAGATGCCAATCCTCTCTATGATGCCTATATTGTAGAGGCAAACACT 

ACTATAGCCGAAGATGCTAAAAAAATTGAAGGTGTCTCTGAGGTTCAAGATGGCGGTGCCAATAC AGAA AGACTC 
30 TTCAAGTTAGCTrCATTTATCCGTGTTTGGGGA CI 1 GAT 

TTCAAATACCATTCGTATTACCATTATTTCCCGCAGTCGCGAAATTCAAATCATGCGCTTGGTCGGAGC TAAAA AC 
AGTTATATCCGTGGACCGTTCTTGTTAGAAGGAGCCTTTATCGGTTTATTGGGAGCTATCGCACCAT^ 
CTTT ATTGTTT ATCAAATTGTTT ACC AATCTGTC AAC A A ATCGTTG GT AG GGC AAAATCT ATC C ATG ATT AGTCC A 
G ATTT ATTT AGTCCGTTG ATG ATTG CCCT A CT ATTTGTG ATTGGGGTTTTC ATTG GTTCATTGGG ATC AG G AAT ATC 
35 CATGCGCCGATTCTTGAAGATTTAG 

4117.1 

ATGAAGAAAGTAAGATTTATTTTTTTAGCTCTGCTATTTTTCTTAGCTAGTC 

GTACTTGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACTCATT 
40 ATCAATCTTGGTTCTATATAAAAGCAGATGCTAACTATGCTGAAAATGAATGGCTAA AGCAA GGTGACGACTAT^ 
TTACCrCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTA^ 

GATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGAC 

TGGGTCTATGATTCTCAATACGATGCnTGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTC 

CAAATTAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGAT TAAT CAAGCTTATG 

45 TGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGCTTITTGACAAACAATACCAATCTTGGTTT^ 

AAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACrATrATTATCTAAAATCCGGTGGCTA 
GCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAAAATGGCT 
TGGGTCTACGATTCTCATAGTCAAGCnTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTT 
GGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAAAATAGCT^ 

50 GTCAAGCTTGGTACTACnTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGG^ 
TTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTAC^ 
AAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGATGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGA 
GG AAA AACTACAAATGAAAATGCTGCTTACTATCAAGTAGTGCCTGTTACAGCCAATGTrTATGATTCAGATGGTG 
AAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGC GCTTG G 

55 CTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATTTACAAGCGCTAGATGCTAG 

CCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGTATCCCAGTAGCTTCTCAT 
CTTTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGATG 
CCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAAGAATTGGATAAGG 
CATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAA^^ 

60 TATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGCCAAAGATAAGAA TAAr i llinG 

GCATTACAGCCTATGATACGACCCCTTACCTTTCTGCTAAGACATTTGATGATGTGGATA^ 

AACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACnTrCCrTGGAAACAAGGCTTCTGGTATGAATGT 
GGAATATGCTTCAGACCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGGTGG 
CAAAGATTAG 
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4119.2 

ATGAAAAAAGTATTACAAAAATATTGGGCATGGGCTTTT 

ATGTGCCGATGTTTCAAGGAGCCTTTTACAGTTITACCAACTGGACAGGATTGACTTATAACT^ 
CTTAAACAACTTTAAGCTCCTCTTCATGGATCCAAAATTCATGAATGCGATTGGCm 
5 GCCATGGTGGTTGGTGAGATTGCACTCGGGATCTTCATTGCGCGTGTCTTGAATTCTAAAATCAAAGGCGAAACCT 
TCTTCCGTGCITGGTTCTrCTTCCCAGCT^ 

TACGGTCTTCCAGCGATTGGAAATGCCCTTCATATTGAATTTTTCCAAACCAGTCTTTTAGGGACT 
CAATCTTTGCGGCTGTCTTTGTCCTTCTTTGCCAAGGGGTGGCT 

TCTATTCCAACTGAGATTACAGAGGCAGCAAGGATTGATGGTGCGACTAGCAAGCAAGTTTTCT GGAA CATTGAA 
10 TTGCCTTACrTGCTACCAAGTGTCTCTATGGTCTTTATCCTAGCCCTAAAAGGTGGGCTGACTG 
CTTTGCCATGACCGGTGGTGGTCCAAACAATGCCACAACCTCACTTGGGCrCTTGGTTTATAACT 
AACAACCAATTCGGTTATGCCAATGCCATTGCCGTAATCTTGTTCTTCTTAATTGTAGTGATTTCG 
GAGAGTATCTAAGAAATTTGAAATTTAA 

15 4119.3 

ATGATGAAACAAGATGAAAGAAAAGCCCTGATTGGCAAATACATTCTATTGATTCTAGG ATCGG T TCTGA TTTTAG 
TGCCGCTCCTTGCTACCCTCTTTAGTTCCITCAAACCCACTAAGGATATTGTAGATAATT^ 
AACTTCACATGGGACAACTTTAGCCGTCTCTTAGCTGATGGGATTGGAGGCTATTATTGGAACTCTGTCGTCA 
CTGTCTTGTCTITACTTGCAGTAATGATCTTTATCCCTATGGCAGCCTACTCCATCGCTCGCAA 

20 AAAAGCCTTTACCATCATGTATACCCTCTTAATCCTCGGAATCTTCGTACCTTTCCAAG 

CGGTrATGATGAGTAAACTCGGTTTGGCTAATACCTTTGGTTTGATCTTGCTCTACTTGACCTA 
ACCCTCTTTCrCTATGTTGGCTATATCAAAATCTCGATTCCAGAAAGTCTGGATGAAGCAGCAGAGATCGATGGGG 
CTAATCAATTTACAACCTATTTCCGCATCATCTTCCCAATGATGAAACCGATGCATGCGACAACCATGATCATCAA 
TGCCCTITGGTTCTGG AATG ACITC ATGTTG CC ACTCCTTGTCTTG AACCGGG ATTCC A AAATGTGG ACTCTG CCTT 

25 TGTTCCAATACAACTACGCAGGCCAATATTTCAACGACTACGGACCAAGCTTTGCCTCTTACGTGGTCGGCATTAT 
CAGTATCACCATTGTCTATCTCTTCTTCCAACGCCATATCATTTCAGGAATGAGCAACGGGGCAGTGAAGTAA 

4119.4 

ATGAAAAGTATTCTTCAGAAAATGGGGGAGCATCCGATGCrGCTl'CnU^ 
30 TTCTrGCACAAAATTGGATGGGTCrrGTGGCTTCAGTAGGAATGTTTCTATTTACTA'r 

TCGATTTTATCCCATAAATTCITTCGATTGATTTTGCAGTTTGT 

AGTTTAGAACATTTCCAAATTGTGAAGAAATTTAACTATGCITTTCTTTC^ 

ACCGGGCAGAAGTGACCTTCTTTAATCCTAATTATTATGGAATTATTTGTTGTTTCT 

CTGTTTACAACGACCAAGTTGAATTGGTTGAAAGTATTCTGTGTGATTGCAGGCTTTGTTAATCT 
35 CTTTACTCAAAATCGAACTGCCTTTCCTGCTATTATCGCTGGAGCA 

GGAAGGCCTTTTGGCTTAGTATTGGGGTCTTCGCGATTGGTTTGAGTTTCCTCTm 

ATGGGTACITTAGACTCTTCrATGGAAGAACGCATTTCTATCTGGGATC 

CTTTTTGGGGTGAAGGGCCATTGACCTATATGAACTCTTATCCTCGGATACATGC TCCTT ATCATGAACATGCCCA 
CAGTCTITATATTGATACGATTCTGAGTTACGGAATTGTGGGGACTATTTTATT AGTT TTGTCT TCT 
40 TTCGCTTGATGATGGATATGAGTCAGGAGTCGGGGAAACGTCCGATTATCGGCCT TTAT CTATCTTTCCTTACAGT 
GGTTGCTGTGCACGGAATTTTTGACTTGGCTCTCTTCTGGATTCAGTCAGGCTTTA 
GTATTCCATTGGAGCATCGAATGTTGGTATCGGACATGACGGATTAA 

4120.1 

45 ATGTCAAAGATGGATGTTCAGAAAATCATTGCACCGATGATGAAGTTTGTGAATATGCGTGGCATTATAGCTCTAA 
AAGATGGGATGTTAGCAATTTTGCCATTGACAGTAGTTGGTAGTTTGTTCTTGATTATGGGACAATTGCCGTTC^ 
AGG ATTAAAT AAG AGCATTGCT AGTGTTTTTGG AG CTAATTGG AC AG AGCCGTTTATG CAAGTATATTC AGG A ACT 
TTTGCTATTATGGGTCTAATTTCTTGTTTTTCAATTGCCTATTCTTATGCT 
AGCTGGAGTTCTATCTGTATCTGCATTCTTTATTTTGCrAAGATCATCTTATATCCCT 

50 GGGACGCTATTAGTAAAGTTTGGTTTGGAGGCCAAGGAATTATCGGTGCTATCATTATAGGTTTGGTAGTAGGAAG 
TATTTATACCTTCTTTATAAAGAGAAAAATTGTTATTAAGATGCCAGAACAAGTTCCACAAGCTATTGCCAAACA^ 
TTTGAAGCAATGATTCCAGCATTTGTAATTTTCTTATCTTCTATGATTGTATAT^ 

TGGCGGAACATTCATAGAAATGATTTATTCTGCTATTCAAGTTCCGTTGCAAGGTTTAACTGGATCTTT 
GCTATTGGAATTGCATTCTTTATATCATTITrGTGGTGGTTTGGTGTTCATGGG 

55 GACAGCTCrGCrTTTATCTAATCrrGATGCTAATAAAGCTATGTTAGCCT 

GC ACATATTGTTACTCAACAA'rn'n'AG ATTCA 1' IT l'l AATTCTATCAGGTTCAGGG ATTAC GTTTGGTCTTGTAGT 

TGCCATGCITmGCAGCAAAATCAAAACAATACCAAGCCTrAGGAAAAGTTGCAGCTT^ 

GT AAATG AG CC AGTTGT ATTTGG ATTTCCG ATTGTC ATG AATCC AGTT ATGTTTGTACCTTTC ATTCTTGTTCCTGT 

ACTTGCAGCrGTGATAGTATATGGAGCTATTGCAACAGGTTTCATGCAGCCATTCTCAGGGGTAACATTGCCTTGG 

60 AGTACACCAGCTATTTTATCAGGATTTTTGGTGGGTGGATGGCAAGGAGTTATTACTCAGCTGGTGATATTAGCGA 
TGTCTACATTGGTTTATTTTCCATTCITTAAAGTACAGGATCGTTTAGCITACCAAAA 

4121.2 

ATGAAGAAAAAGGACTTAGTAGACCAACTAGTCTCAGAGATCGAGACGGGGAAAGTCAGGACACTGGGAATATA 
65 CGGTCATGGAGCTTCAGGTAAATCAACCTTTGCACAGGAATTGTACCAAGCT 
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CTAGAGACAGATCCTTATATCACCTCAGGACGCCATCTCGTAGTACCCAAGGACGCGCCGAATCAAAAGGTGACA 
GCCAGTCTGCCAGTGGCGCATGAACTGGAGAGTTTGCAGAGAGATATCCTTgCTTGCAGGCGGGTATGGATGTCTT 
GA 

5 

4122.1 

ATGAAGAAAAGATACCTAGTCTTGACAGCTTTGCTAGCCTTGAGTCTAGCAGCTTC 

AATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAACAGTCGGTAGTAAGTCTCAAGGAGC 
TGCCCAGAAGAAAGCAGAAGTGGTCAATAAAGGTGATTACTACAGCATTCAAGGGAAATACGATGAAATCATCGT 

10 AGCCAACAAACACTATCCATTGTCTAAAGACTATAATCCAGGGGAAAATCCAACAGCCAAGGCAGAGTTGGTCAA 
ACTCATCAAAGCGATGCAAGAGGCAGGTTTCCCTATTAGTGATCATTACAGTGGTTTTAGAAGTTATGAAACTCAG 
ACCAAGCTCTATCAAGATTATGTCAACCAAGATGGAAAGGCAGCAGCTGACCGTTACTCTGCCCGTCCTGGCTAT 
AGCGAACACCAGACAGGCTTGGCCmGATGTGATTGGGACTGATCGTGATTTGGTGACAGAAGAAAAAGCAGCC 
CAATGGCTCTTGGATCATGCAGCTGATTATGGCTTTGTTGTCCGTTATCTCAAAGGCAAGGAAAAGG 

15 ATATGGCTGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAATTGCTGCAAGTGGTCTCAGTTTGG 
AAGAATACTATGGCTTTGAAGGCGGAGACTACGTCGATTAA 

4125.6 

ATGCGTAAATTCTTAATTATTTTGTTGCTACCAAGTTTTTTGACCATTT 

20 TCGTCTATACTTCGAAAGAAATTTATTACCTTTCACAATCTGACITTGGTATT^ 

CCCATGGTTTATGGAGAGGTTCCTGTTTATGCGAATGAAGATTTAGTAGTGGAATCTGGGAAATTGACTCCCAAAA 
CAAGTTTTCAAATAACCGAGTGGCGCTTAAATAAACAAGGAATTCCAGTATTTAAGCTATCAAATCA 
AGCTGCGGACAAACGATTTTTATATGATCAATCAGAGGTAACTCCAACAATAAAAAAAGTATGGTTAGAATCTGA 
CTTTAAACTGTACAATAGTCCTrATGATTTAAAAGAAGTGAAATCATCCTTATCAGCTTATTCGCAAGTATC 

25 GACAAGACCATGTTTGTAGAAGGAAGAGAATTTCTACATATTGATCAGGCTGGATGGGTAGCTAAAGAATCAACT 
TCTGAAGAAGATAATCGGATGAGTAAAGTTCAAGAAATGTTATCTGAAAAATATCAGAAAGATTCTTTCT 
ATGTTAAGCAACTGACTACTGGAAAAGAAGCTGGTATCAATCAAGATGAAAAGATGTATGCAGCCAGCGTTTTGA 
AACTCTCTTATCTCTATTATACGCAAGAAAAAATAAATGAGGGTCTTTATCAGTTAGATACGACTGTAAAATACCT 
ATCTGCAGTCAATGATTTTCCAGGTTCTTATAAACCAGAGGGAAGTGGTAGTCTTCCTAAAAAAGAAGATAATAA 

30 AGAATATTCTTTAAAGGATTTAATTACGAAAGTATCAAAAGAATCTGATAAT 

TACATTTCAAACCAATCTGATGCCACATTCAAATCCAAGATGTCTGCCATTATGGGAGATGATTGGGATCCAAAAG 
AAAAATTG ATTTCTTCT AAG ATGG CCG G G AAGTTT ATG G AAGCT ATTTAT AATC AAAATG G ATTTGTGCT AG AGTC 

TTTGACTAAAACAGATTTTGATAGTCAGCGAATTGCCAAAGGTGTTTCTGTTAAAGTAG 

CTC AT AAAATTGG AG ATGCGG ATG AATTT AAG C ATG AT ACGGGTGTTGTCT ATGC AG ATTCTCCATTT ATTCTTTCT 

35 ATTTTCACTAAGAATTCTGATTATGATACGATTTCTAAGATAGCCAAGGATGm 

4125.7 

ATGAAAAAACAAAATAATGCrmAATTAAAAATCOTTTCTATGGTTATTATT^ 

CCAGTATTTCTATTCTGGGAATAACTCAGGAGGAAGTCAGCAAATCAACTATACTGAGTTGGTACAAGAAATTAC 
40 CGATGGTAATGTAAAAGAATTAACTTACCAACCAAATGGTAGTGTTATCGAAGTTTCTGGTGTCTATAAAAATCCn' 
AAAACAAGTAAAGAAGAAACAGGTATTCAGTTTTTCACGCCATCTGTTACTAAGGTAGAGAAATTTACCAGCACT 
ATrCTTCCTGCAGATACTACCGTATCAGAATTGCAAAAACTTGCTACTGACCATAAAGCAGAAGTAACT 
ATGAAAGTTCAAGTGGTATATGGATTAATCTACTCGTATCCATTGTGCCATTTGGAATTCT 

TCTATGATGGGAAATATGGGAGGAGGCAATGGCCGTAATCCAATGAGTTTTGGACGTAGTAAGGCTAAAGCAGCA 
45 AATAAAGAAGATATTAAAGTAAGATTITCAGATGTTGCTGGAGCTGAGGAAGAAAAACAAGAACTAGTTGAAGTT 
GTTGAGTTCTTAAAAGATCCAAAACGATTCACAAAACTTC 

CTCCGGGGACAGGTAAAACTTTGCTrGCTAAGGCAGTCGCTGGAGAAGCAGGTGTTCCATrCTTTAG 
TTCTGACTTIX3TAGAAATGTTTGTCGGAGTTC 

GCACCAGCTATCATCTTTATCGATGAAATTGATGCTGTTGGACGTCAACGTGGAGTCGGTCTCGGCGGAGGTAATG 
50 ACGAACGTGAACAAACCTTGAACCAACnTrTGATTGAGATGGATGGTTTTGAGGGAAATGAAGGGATTATCGTCA 
TCGCTGCGACAAACCGTTCAGATGTACTTGACCCTGCCCTTn'GCGTCCAGGACGTTTTGATAGAAAAGTATTG^ 
TGGTCGTCCTGATGTTAAAGGTCGTGAAGCAATCTTGAAAGTTCACGCTAAGAATAAGC 
CTTTAGCAGAAGATGTTGATTTGAAATTAGTGGCTCAACAAACTCCAGGCTTTGTTGCT 
CTrGAATGAAGCAGCrrTAGTTGCTGCTCGTCGCAATAAATCGATAATTGATGCITCAGATATTGATGAAGC 
55 GATAGAGTTATTGCrGGACCTTCTAAGAAAGATAAGACAGTTTCACAAAAAGAACGAGAATTGGTTGCTTACCAT 
GAGGCAGGACATACCATTGTTGGTCTAGTCTTGTCGAATGCTCGCGTTGTCCATAAGGTTACAATTGTACCACGCG 
GCCGTGCAGGCGGATACATGATTGCACTTCCTAAAGAGGATCAAATGCTTCTATCTAAAGAAGATATGAAAGAGC 
AATTGGCTGGCTTAATGGGTGGACGTGTAGCTGAAGAAATTATCrTTAATGTCCAAACCACAGGAGCTTCAAACG 
ACTTTGAACAAGCGACACAAATGGCACGTGCAATGGTTACAGAGTACGGTATGAGTGAAAAACTTGGCCCAGTAC 
60 AATATGAAGGAAACCATGCTATGCTTGGTGCACAGAGTCCTCAAAAATCAATTTCAGAACAAACAGCTTATGAAA 
TTGATGAAGAGGTTCGTTCATTATTAAATGAGGCACGAAATAAAGCTGCTGAAATTATTCAGTCAAATCGTGAAAC 
TCACAAGTTAATTGCAGAAGCATTATTGAAATACGAAACATTGGATAGTACACAAATTAAAGCTCTTTACGAAAC 
AGGAAAGATGCCTGAAGCAGTAGAAGAGGAATCTCATGCACTATCCTATGATGAAGTAAAGTCAAAAATGAATGA 
CGAAAAATAA 
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4125.10 

ATGAGGGAACCAGATTTTTTAAATCATTTTCTCAAGAAGGGATATTTGAAAAA 
TTTCTGGTGGATTAGATTCCATGTTTCTATTTAAGGTATTGTCTACTTATCAAAAAGAGTTAG 
CTAGCTCATGTGAATCATAAGCAGAGAATTGAATCAGATTGGGAAGAAAAGGAATTAAGGAAGTTGGCTGCTGAA 
5 GCAGAGCnTCCTATTTATATCAGCAATTTTTCAGGAGAATTTTCAGA^^ 

TTTTTCAAGAGGTCATGAAAAAGACAGGTGCGACAGCTTTAGTCACTGCCCACCATGCTGATGATCAGGTGGAAA 
CGATTTTTATGCGCTTGATTCGAGGAACTCGCTTGCGCTATCTATCAGGAATTAAGGAGAAGCAAGTAG 
G ATAGAAATCATTCGTCCCTTCITGCATTTTCAGAAAAAAG ACTTTCCATCAA IT1T I CACTTTG AAGAT ACATCA 
AATCAGGAGAATCATTATTTTCGAAATCGTATTCGAAATTCTTACTTACCAGAATTGGAAAAAGAAAATCCTCGAT 
10 TTAGGGATGCAATCTTAGGCATTGGCAATGAAATTTTAGATTATGATTTGGCAATAGCrGA^ 
TAATGTGGAAGATTTACAGCAGTTATTTTCTTACTCTGAGTCT^ 

GTTTTCCAGATTTGAATCTTACAAAAGCTCAGTTTGCTGAAGTTCAGCAGATTTTAAAA 

TCATCCGATTAAAAATGGCTATGAATTGATAAAAGAGTACCAACAGTTTCAGATTTGTAAAATCAGTCCGCAGGCT 
GATGAAAAGGAAGATGAACTTGTGTTACACTATCAAAATCAGGTAGCTTATCAAGGATATTTAI'IUTCTlTrGGAC 
15 TTCCATTAGAAGGTGAATTAATTCAACAAATACCTGTTTCACGTGAAACATCCATACACATTCGTCATCGAAAAAC 
AGGAGATGTTTTGATTAAAAATGGGCATAGAAAAAAACTCAGACGTTTATTTATTG 
ATTTGAAAATCCCTATGGAAAAGAGAAACTCTGCTCTTATTATTGAGCAATTTGGTGAAATTGTCT 
AATTGCG ACC AAT AATTTG AGTAAAAAAACG AAAAATG AT AT AATG AA C ACTGTACTTT AT AT AG AAAAAAT AG A 
TAGGTAA 

20 

4126.1 

ATGAAGCGTTCTTCTCTTTTAGTTAGAATGGTTATTTC 

ACTrTCTACTATCAATCAAGTTCTrCAGCCATTXjAGGCCACCATTGAGGGCAACAGCCAAA 
ACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTTTGACCCAGCAGACGG 

25 TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGACCATCTTGAAGTCAGA 
TAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTG TGCA GATGAA 
AACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCTGTTTTGACTCCA 
GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTGACTCAAGAACTTGTTGATGCAAAGGGAGCCAAT CTTGG TG 
TGCTTCGTTTGGATATTTCTTATGAAACTCTGGAAGCCTATCrCAATCAACTCCAGTTGGGGCAGCAGGG 

30 CTTCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTT^ 

GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 
GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 
ACCnTGCTTGGGGCCAGTGTCACATCTCTTCITC^ 

TCCTTrGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCnTCGTGCCAA 
35 GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTATTC 
GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAGCCAGATTAATCCACATTTCCTCT 
CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 
ACCT ATTTCCG CTTGG CGCTC AATC AAGG C AAGG ACTTG ATTTGTCTCTCTG ACG AAATCAATC ATGTC CGCC AGT 
ATCTCTTT ATCC AG AAA C AA CG CT ATG G AG AT AAG CTG G AAT A CG AAATT AATG AAAAT GTTG CCTTTG AT AATTT 
40 AGTCTTACCCAAGCTGGTCCTACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGTCA 
GGGCCATATTAAACTTTCTGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 
CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 
AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGAAATATAT 
ATAAATAGAATAGAAACTAGCTAA 

45 

4126.7 

ATGAAGCGTTCTTCTCTTTTAGTTAGAATGGTTATTTCCATCTTTCTGGTCTr^ 

ACTTTCTACrATCAATCAAGTTCTTCAGCCATrGAGGCCACCATrGAGGGCAACAGCCAAACGACCATCAGCCAG 

50 ACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACT^ 

TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTVriTGACCATCTrGAAGTCAGA 
TAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTGTGCAGATGAA 
AACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATC 
GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAATCTTGGTG 

55 TGCTTCGTTTGGATATTTCTTATGAAACTCTGGAAGCCTATCTCAATCAACTCCAGTTGGGGCAGCAGGGCTT^ 

CTTCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAGCAAAATGGAG 
GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 
GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 
^T^GGGG CC AGTGTC AC ATCTCTTCTTGTCTGTCTCTG CTT AGTGTGGTTC AGTCTT AAACG CTGG ATTG C 

60 TCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGT^ 

GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTATTC 
GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTrCGAGCCAGATTAATCCACATTTCCTCTAT^ 
CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 
ACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTGACGAAATCAATCATGTCCGCCAGT 

65 ATCTCITTATCCAGAAACAACGCTATGGAGATAAGCTGG AAT ACGAAATT AATG AAAATGTTGCCTTTGA 
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AGTCTTACCCAAGCTGGTCCTACAACCCOT 

GGGCCATATTAAACTTTCT.GTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 
CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 
AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAG^ 
5 ATAAATAGAATAGAAACTAGCTAA 

4127.4 

ATGTTTTTTAAATTATTAAGAGAAGCTCTTAAAGTCAAGCAG 

TTTGGTCTTTCGTATCGGAACTAGCATTACAGTTCCTGGTGTGAATGCCAATAGCTTGAATGCTTTAAGTGG 
10 CCTTCTTAAAC\TGTTGAGCTTGGTGTCGGGGAATGCCCTAAAAAACrm 

CTATATCACCGCTTCTATTGTTGTCCAACTCTTGCAAATGGATATTTTACCCAAGm 

GGGGAAGTAGGTCGAAGAAAATTGAATCAAGCTACTCGTTATATTGCTCTAGTTCTCGCTTT^ 

GGATTACAGCTGGTTTTAATACCTTGGCTGGAGCTCAATTGATTAAAACTGCm 

GATTGGTATCATCTTAACAGCTGGTAGTATGATTGTCACTTGGTTGGGTGAGCAAATTACAGATAAGGGATACGGA 
15 AACGGTGTTTCCATGATTATCTTTGCCGGGATTGTTTCCT 

ACTTTGTGAACGTCCCAAGTAGCCGTATCACTTCATCTATCATTTTCGTAATCATTTTGATTATTACTGTAT^ 
ATTATTTACTTTACAACTTATGTTCAACAAGCAGAATACAAAATTCCAATCCAATAT ACT 

CTCCATCTAG CTCTTACCTTCCGTTAAAAGTAAACCCTG CTGG AGTTATCCCTGTTATCTTTGCCAGTTCG ATTACT 
GCAGCGCCTGCGGCTATTCTTCAGTTTTTGAGTGCCACAGGTCATGATTGGGCTTGGGTAAGGGTAGCACAAGAGA 
20 TGTTGGCAACTACTTCTCCAACTGGTATTGCCATGTATGCTTTGTTGATTATTCTCT^ 

TACAGATTAATCCTGAAAAAGCAGCAGAGACCTACAAAAGAGTGGTGCCTATATCCATGGAGTTCGTCCTGGTAA 
AGGT AC AG AAG AAT AT ATGTCT AA ACTTCTTCGTCGTCTTG CAACTGTTG GTTC C CTCTTCCTTG GTGTG A 



25 4127.5 

ATGGATATTAGACAAGTTACTGAAACCATCGCCATGATTGAGGAGCAAAACTTCGATATTAGAACCATTACCATG 
GGG ATTTCTCTTTTGG ACTGTATCG ATCC AG AT ATCAATCGTGCTG CGG AG AAAATCT ATC AAAAAATTACG ACAA 
AGGCGGCTAATTTAGTAGCTGTTGGTGATGAAATTGCGGCTGAGTTGGGAATTCCTATCGTTAATAAGCGTGTATC 
GGTGACACCTATTTCTCTGATTGGGGCAGCGACAGATGCGACGGACTACGTGGTTCTGGCAAAAGCGCTTGATA^ 

30 GGCTGCGAAAGAGATTGGTGTGGACTTTATTGGTGGTTTITCTGCCr^ 

GAGATTCTCATCAATTCCATrCCTCGCGCTTTGGCTGAGACGGATAAGGTCTGCTCGTCAGTCAAT ATCG GCTCAA 
CCAAGTCTGGTATTAATATGACGGCTGTGGCAGATATGGGACGAATTATCAAGGAAACAGCAA ATCTT TCAGATA 
TGGGAGTGGCCAAGTTGGTTGTATTCGCTAATGCTGTTGAGGACAATCCATTTATGGCG GGTG CCTTTCATGGTGT 
TGGGGAAGCAGATGTTATCATCAATGTCGGAGTTTCTGGTCCTGGTGTrGTGAAACGTGCTTTGGAAAAAGTTCGT 

35 GGACAGAGCTTTGATGTAGTAGCCGAAACAGTTAAGAAAACTGCCTTT 

GTCAAATGGCCAGTGAGAGACTGGGTGTGGAGTrTGGTATTGTGGACTTGAGTTTGGCACCAACCCCTGCGGTTGG 
AGACTCTGTGGCACGTGTCCTTGAGGAAATGGGGCTAGAAACAGTTGGCACGCATGGAACGACGGCTG CCTTG GC 
CCTCTTGAACGACCAAGTTAAAAAGGGTGGAGTGATGGCCTGCAACCAAGTCG 
CCTGTTTCTGAGGATGAAGGAATGATTGCTGCAGTGCAAAATGGCTCTCTTAATTTAGAAAAACT 

40 CGGCTATCntjTTCTGTTGGATTGGATATGATTGCCATCCCAGAAGATACXjCCTGCTGAAACTATTGCGGCT 

TGCXjGATGAAGCAGCAATCGGTGTTATCAACATGAAAACAACAGCTGTTCGTATCATTCCCAAAGGAAAAGAAGG 

CGATATGATTGAGTTTGGTGGTCrATTAGGAACTGCACCCGTTATGAAGGTTAATGGGGCITC 

ATCTCTCGCGGTGGACAAATCCCAGCACCAATTCATAGTTTTAAAAATTAA 

45 4128.1 

ATGACACAGATTATTGATGGGAAAGCTTTAGCGGCCAAATrGCAGGGGCAGTTGGCTGAAAAGACTGCAAAATTA 
AAGGAAGAAACAGGTCTAGTGCCTGGTTTGGTAGTGATTTTGGTTGGGGACAATCCAGCCAGCCAAGTCTACGTT 
CGCAACAAGGAGAGGTCAGCCCTTGCGGCTGGTrTCCGTAGCGAAGTAGTACXjGGTTCCAGAGACCATTACTCAA 
GAGGAATTGTTAGACCTGATTGCTAAATACAATCAGGATCCAGCTTGGCATGGGATTTTGGrrCAGTT 
50 C AAAAC A C ATTG ATG AAG AGG CGGTTCT ATTG GCT ATTG ACCC AG AAAAGG ATGTGG ATG GTTTCC ATCCTCT AA 

ACATGGGGCGTCTTTGGTCTGGTCATCCAGTCATGATTCCTrCGACACCGGCAGGAATTATGGAAATG'TTCCATGA 
ATATGGGATTGACTTGGAAGGTAAAAATGCAGTCGTCATCGGTCGATCCAATATTGTCGGAAAACCTATGGCCCA 
GCTTCTTTTGGCAAAGAATGCAACAGTAACCITGACTCACTCACGTACTC 

GCAGATATTCTGGTTGTTGCAATCGGTCGTGCCAAGTTTGTGACTGCTGACTTTGTCAAACCAGGTGCGGTAGTCA 
55 TTGACGTTGGGATGAACCGCGATGAAAATGGTAAGCTCTGTGGGGATGTTGATTATGAGGCGGTTGCCCCACTTGC 
TAGCCACATTACGCCAGTCCCTGGAGGTGTCGGTCCTATGACCATTACTATGCTGATGGAGCAAACCTATCAGGCA 
GCACTTAGGACATTGGATAGAAAATAA 



60 ATGTCT AAATTTAATCGTATTCATTTGGTGGTACTGGATTCTGTAGGAATCGGTGCAGCACCAGATGCT AAT AACT 

TTGTCAATGCAGGGGTTCCAGATGGAGCITCTGACACACTGGGACACATTTCAAAAACAGTTGGTTTGAATGTC^ 
AAAC ATGG CT AAAAT AGGTCTTG G A AAT ATTCCTCGTG A A ACTCCTCTT A AG ACTGT AGC AGCTG AAAG C AATCC 
AACTGGATATGCAACAAAATTAGAGGAAGTATCTCrTGGTAAGGATACTATGACTGGACACTGGGAAATCATGGG 
ACTCAACATTACrGAGCCTTTCGATACTTTCTGGAACGGATTCCCAGAAGAAATCCTC 

65 TCAGGACGCAAGGTTATTCGTGAAGCCAACAAACCTTATTCAGGAACGGCTGTTATCTATGATTTTGGACCACGTC 
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AGATGGAAACTGGAGAGTTGATTATCTATACTTCAGCTGA 

TCCTTTGGATGAATTGTACCGTATCTGTGAATACGCTCGTTCGATTACCCTTGAGCGTCCTGCCCTTCTTGGTCGCA 
TCATTGCTCGCCCTTATGTAGGTGAACCAGGTAACTTCACTCGTACGGCAAACCGTCGTGACTTGGCTGTATCTCC 
ATTTTTCCCAACTGTTTTGGATAAATTGAATGAGGCTGGTATCGATACrTATGCT 
5 TTTAACGGTGCTGGTATCAACCATGACATGGGTCACAACAAGTCAAATAGTCATGGAATTGATACACTATTGAAG 
ACTATGGGACTTGCTGAGTTTGAAAAAGGATTCTCATTCACAAACCTAGTTGACTTTGATGCCCTT^ 
GTCXjTAATGCTCACGGTTACCGTGATTGCTTGCATGAGTTTGATGAACGCTTACCTGAAATTATCGCAGCT 
AGAGAATGACCTTCTCTTGATTACTGCGGACCATGGAAATGACCCAACGTATGCAGGAACGGATCACACTCGGGA 
ATATATTCCATTGTTGGCCTATAGCCCTGCCTTTAAAGGAAATGGTCTCATTCCAGTAGGACATTTTGCAGATA 
10 CAGCGACTGTTGCCGATAACnTTGGTGTGGAAACTGCTATGATTGGGGAAAGTTrCTTAGATAAATTGGTATAA 

4129.2 

ATGTTTATrTCCATCAGTGCTGGAATTGTGACATTTTTACTAACTTTAGTAGAAATTCC 

TAGAAAGGCGCAAA TTACA GGCCAGCAGATGCATGAGGATGTCAAACAGCATCAGGCAAAAGCTGGGACTCCTA 
15 CAATGGGAGGTTTGGTTrrCTTGATTACTTCTGTTTrGGTTG 

ATAATGTGGG AATG ATTTTGTTCATCTTGGTCTTGTATGGCTTGGTCGG A' 1' i 'IT I AG ATG ACTTTCTCAAGGT CTTT 

CGTAAAATCAATGAGGGGCTTAATCCTAAGCAAAAATTAGCTCTTCAGCTTCT^ 

TCTAT GAGCGCGGTGG CGATAT CCTGTCTGTCTTTGGTTATCCAGTTC 

CTmCTGGCTAGTCGGTTTTTCAAACGCAGTAAACTTGACAGACGGTGTTGACGGTTTA 
20 GATTAGTTTOTCTGCCTATGGAGTTATTGCCTATGTGCAAGGTCAGATGOATATTCT 

TTGGTGGTTTGCTCGGTTTCTTCATCTTTAACCATAAGCCTGCCAAGGTCrrTATGGGTGATGTGGGA^ 

CTAGGTGGGATGCTGGCAGCTATCTCnV\TGGCT 

TTTTTGAAACAACTT CTGTT ATGATGCAAGTCAGTTATTTCAAACTGACAGGTGGTAAACGT^ 
GCCTGTACATCACCATTTTGAGCTTGGGGGATTGTCTGGTAAAGGAAATCCTTGGAGCGAGTGGAAGGTTGACTTC 
25 TTCTTTTGGGGAGTGGGACrrCTAGCAAGTCTCCTGACCCTAGCAATTTTATATTTG 

4133.1 

TTGTTTAAGAAAAATAAAGACATTCTTAATATTGCATTGCCA 

G AATGGTG G AC AGTT ATTTGGTTG CTC ATTT AG G ATTG AT AGCT ATTTC AGG G GTTTC AGT AG CTG GTAATATTAT 
30 C ACCATTT ATC AG GCG ATTTTCATCG CTCTG GG AGCTG CTATTTCCAGTGTT ATTTC AAAAAGCATAGG G C AG AAA 

GACCAGTCGAAGTTGGCCrATCATGTGACTGAGGCGTTGAAGATTACCTTACTATTAAGTTTCCTn^ 
GTCCATCTTCGCTGGGAAAGAGATGATAGGACrTTTGGGGACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGTA 
TCTATCITrGGTAGGCGGATCGATTGTTCTCTTAGGTTTAATGACT 

AATCC ACGTCTGCCTCTCTATGTTAGTTTTTTATCC AATG CCTTG AAT ATT C 1 ' IT lTl 'C AAGTCTAGCT ATTTTTGTT 
35 CrGGATATGGGGATAGCTGGTGTTGCTTGGGGGACAATTGTGTCT^ 

AATTAAAACTGCCrrATGGGAAGCCAACrTTTGGTTTAGATAAGGAACTGTTG 

AGAGCGACTTATGATGAGGGCTGGAGATGTAGTGATCATTGCCTTGGTCGTTTCTTTTGGGACGGAGGCAGTTGCT 
GGGAATGOVATCGGAGAAGTCrTGACCCAGTTTAACTATATGCCrGCCnTGGCGTCGCTACGGCAACGGTCATGC 
TGTTGGCCCGAGCAGTTGGA GAGG ATGATrGGAAAAGAGTTGCTAGTTrGAGTAAACAAA ^ 
40 GTTCCTCATGTTGCCCCTCTCCITTAGTATATATGTCTTGG 

TAGCGGTGGAGGCTAGTGTTCTAGTGACACTGTTTTCACTACrTGGGACCCCrATGACGACAGGAACAGTCATCTA 
TACGGCAGTCTGGCAGGGATTAGGAAATGCACGCCTCCCITnTATGCGACAAGTATAGGAATGTGGTGTATCCGC 
ATTG GGAC AGGATATCTGATGGGGArrGTGCTTGGTTGGGGCTTGCCTGGTATTTGGGCAGGGTCTCT 
ATGGTTTTCGCTGGTTATTTCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG 

45 

4135.2 

ATGCAAACTCAAGAAAAACACTCGCAAGCAGCCGTTCTTGGCTrGCAGCACTTACrAGCCATGTACTCA^ 
ATCCTGGTTCCCATCATGATTGCGACAGCCCTTGGCTATTCAGCTGAGCAGTTGACCTACCTGATTTCTAC 
TCTTCATGTGTGGGGTGGCAACCTrCCTCCAACrCC^ 
50 TGGAGTTGCATTCCAGTCGGTCGCTCCCTTGATTATGATTGGGCAAAGCCATGGTAGTGGCGCTATGTTTGGTGCC 
CTTATCGCATCTGGGATTTACGTGGTTCTTGTTTCAGGCATCTTCT 

AACAGGATCTGTTATTACCACGATrGGTTTAACCITGATCCCTGTCGCTATTGGAAATATGGGAAATAACGTTCCA 

GAGCCAACTGGTCAAAGTCT CTTGC TTGCAGCTATrACTGTTCTGATTATCCrcnTGATCAACA 

GATTTATCAAGTCTATCTCTATTTTGATTGGTCTGGTTGTTGGAACTGCCATTGCTGCTACT 

55 TTCTCrCCTGTTGCGGTAGCrCCACTTGTCCATGTCCCAACTCCACTCTACTTTGG 

ATCTATTGTCATGATGTGTATCATCGCAACGGTGTCTATGGTTGAGTCAACTGGTGTTTATCrGGCCTTGTCTGATA 
TCAC AAAG GATCCAATCGACAGCACGCGCCTTCGCAACGGATACCGCGCAGAAGGTTTGGCCGTACTTCTCGGAG 
GAATCTTTAACACCn'CCCriTACACCGGATTTTCACAAAACGTTGGTTTGGT^ 
CCrGCCAATCTACTACGCAGCTGGTTTCCTGGTTCTCCTTGGACTGCTTCCTAAGTTTGGCGC 

00 TTCCAAGCT CCGT CCTCGGTGGTGCCATGCTGGTAATGTTTGGTTTTGTATCAATTCAAGGGATGCAAATCCTCGC 
CCGTGTTGACT TTGCT AACAATGAACACAACnTCCTTATCGCAGCTGTTTCAATCGCTGC 
AACAGTAATCrCnTTGTCAGCATGCCGACAGCCTTCCAAATGTTCTTCTCAAACGGAATCGTCGTAGCCA 
TCG CTATTGTCCTCAATG CCGTATTAAATCATAA A AAG AAATAA 

65 4136.2 
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ATGAAAGATAGAATAAAAGAATATTTACAAGACAAGGGAAAGGTGACTGTTAATGATTTGGCTCAGGCTT TGGGA 
AAAGACAGTTCCAAGGATTTTCGTGAGTTGATTAAAACCTTGTCCTTAATGGAAAGAAAG CA 
AAGAAGATGGTAGTCTGACATTAGAAATTAAGAAAAAACATGAGATTACCCTCAAGGGGATTTTTCATGCCCATA 
AAAATGGCTTTGGCTTTGTTAGTCTGGAAGGCGAGGAGGACGACCTTTTTGTAGGGAAAAATGATG 
5 TATTGATGGTGATACCGTCGAGGTAGTGATTAAGAAAGTCGCTGACCGCAATAAGGGAACAGCAGCAGAAGCCAA 
AATTATTGATATCCTAGAACACAGTTTGACAACAGTTGTCGGGCAAATCGTTCTGGATCAGGAAAAACCTAAGTAT 
GCTGGCTATATTCGTTCAAAAAATCAGAAAATCAGTCAACCGATTTATGTTAAGAA ACCA GCCCTAAAATTAGAA 
GG AACAG AAGTTCTC AA AGTCTTTATCG ATAAATACCCAAGC AAG A AAC ATG A riTCri" I'GTCGCGAGTGTTCTCG 
ATGTAGTGGGACACTCAACGGATGTCGGAATTGATGTTCTTGAGGTCTTGGAATCAATGGACATTGTATCCGAGTT 
10 TCCAGAAGCTGTTGTTAAGGAAGCAGAAAGTGTGCCTGATGCTCCGTCTCAAAAGGATATGGAAGGTCGTCTGGA 
TCTAAGAGATGAAATTACCTTTACCATTGACGGTGCGGATGCCAAGGACTTGGACGATGCAGTGCATATCAAGGC 
TCTGAAAAATGGCAATCTGGAGTTTGGGGTTCACATCGCAGAT^ 

GACAAGGAAGCCCTTAACCGTGCGACTTCTGTTTACGTGACAGACCGAGTGGTGCCAATGC1TCCAGAACGACTA 

TCAAATGGCATCTGCTCTCTCAATCCCCAAGTTGACCGCCTGACCCAGTCTGCTATTAT 

15 GGAGATTGATAAACATGGTCGTGTGGTCAACTATACCATTACACAAACAGTTATCAAGACCAGTTTTCGTATGACC 
TATAGCGATGTCAATGATATCCTAGCTGGCGATGAAGAAAAGAGAAAAGAATATCATAAAA TTGTA TCAAGTATC 
GAACTCATGGCCAAGCTTCATGAAACTTrAGAAAACATGCGTGTG 

AAGCGAAGATTTTAGTGGATAAACAAGGTAAGCCTGTTGATATCGTTCTTCGGCAGCGTGGTATT GCCGA GCGGA 
TGATTGAGTCTTTTATGTTGATGGCTAATGAAACAGTTGCCGAACATTTCAGCAAGTTG GATT^ 
20 CGAATTCACGAGGAGCCTAAGGCTGAAAAGGTTCAGAAGTTTATTGATTATGCTTCGAGTTTTGGCTTGCGC 

ATGGAACTGCCAGTGAGATTAGTCAGGAGGCACTTCAAGACATCATGCGTGCTGTTGAGGGAGAACCTTATGCAG 

ATGTATTGTCCATGATGCTTCTTCGCTCTATGCAGCAGGCTCGTTATTCGGAGCACAATCACGGCCACTATG 

AGCTGCrGACTATTATACTCACTTTACCAGTCCAATTCGTCGTTATCCAGACCITCTT 

ATTACGGCCGTTCTAAGGAAATAGCAGAGCATTTTGAACAAGTGATTCCAGAGATTGCGACCCAGTCTTCCAACC 
25 GTGAACGTCGTGCCATAGAAGCTGAGCGTGAAGTCGAAGCCATGAAAAAGGCTGAGTATATGGAAGAATACGTGG 
GTGAAGAGTATGATGCAGTTGTATCAAGTATTGTCAAATTCGGTCTCnTrGTCGAATTGCCAAACACAGTTGAAGG 
CTTGATTCACATGACTAATCTGCCTGAATTTTATCATTTCAATGAGCGTGATTTGACTCTT^ 
GGTATCACTTTCCGAGTGGGTCAGCAGATCCGTATCCGTGTTGAAAGAGCGGATAAAATGACTGGAGAGATTGAT 
TTTTCATTCGTACCTAGTGAGTTTGATGTGATTGAAAAAGGCTTGAAACAGTCTAGTCGT 
30 AGTGGCAGAGGGCGTGATTCAAATCGTCGTTCGGATAAGAAGGAAGA CAAGAG AAAATCAGGACGCTCAAATGA 
T AAGCGT AAG C ATTC AC AAAAAG AC AAG AAG AAAAA AGG AAAG AAACCTTTTT ACAAG G AAGT AG CT AAG AAAG 
GAGCCAAGCATGGCAAAGGGCGAGGGAAAGGTCGTCGCACAAAATAA 

4137.2 

35 ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCITGATTGTT^ 

CTATTTCTCrAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATG 
GTATCCATTTTTGCCACAATGCTCAGTCCGATTC 

TTTATGGTTTGCrCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCT^ 
TAGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAAGCACnTCGTATTATTTCAGCA 

40 GTTTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCT 

TTGACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCITATACrGGTGG 
TTATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCrTTATTTG 
TAAAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCnTGCrGCAAATGAAAAAC^ 
TCCTAACTTGCTITCAAACrCCATCrrTTTTAATTGTGATGGGTTCAGG 

45 TCAAGATTTGGTTCAACGTTTTACTACAACACAAAATATTAAGAAACTTAATAAGATG 

TTGTCACTTGCAACTGCAACAGTCTmACITGATTGGTACAGGCTTGTACGTATTCTATC^ 
ATAGTGCAGCTAGCAATATCCCTCAAGACCAAATCrTTATGTACrTTATTGCATACCAGTTACCAGTAGGTATCAC 
AGGTTTGATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACT 
TGGACATTGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTAT 

50 CTCTAGCAGTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACT 

TTCAATAGTTTCATGGGACTTGTACTTGGTCTACnTGGTGGTGTATTTATTCrTC 

TAAACAAGGTGCTTATGCAGCGCrGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCTC 
GCrGTTAGCTACTGGGCATATTCATTGATTTCAATCTCrG^ 

TGGAAATAAAGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGA 
55 AGTTCGTCACTAA 

4138.1 

ATGAAATTTAGTAAAAAATATATAGCAGCTGGATCAGCrGTTATCGTATCCITGAGTCrATGTGCCTATGCACTAA 
ACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTCAAGTCAGAAAA 

60 GTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAATTGTAATCAAAATTACAG 
ATCAGGGCTATGTAACGTCACACGGTGACCACTATCATTACTATAATGGGAAAGTTCCnTATGATGCCCTCnTAG 
TGAAGAACTCITGATGAAGGATCCAAACrATCAACITAAAGACGCTGATATTGTCAATGAAGTCAAGGGTGGTTA 
TATCATCAAGGTCGATGGAAAATATTATGTCTACCrGAAAGATGCAGCTCATGCrGATAATGTTCGAACTAAAGAT 
GAAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGTTAACTCTAATGTTGCTGTAGCAAGGTCT 

65 CAGGGACGATATACGACAAATGATGGTTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATA 
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ACATCTGGCTGGAAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCTAGTGACAATAACACGCAATC 
TGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCTTTTGAAGGAACTCTATGA 
TTCACCTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCTTTGACCCTGCTAAGATTATCAGTCGTACACCA 
5 AATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACAGCAAGCTTTCrGCOT 

TTGCCAGAATGGTGCCTATCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAAACCTAATGAAGTAGTGTCTAG 
TCTAGGCAGTCTTTCAAGCAATCCTTCTTCTTTAACGACAAGTAAGGAGCTCTOT 

TTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTCCATTACATTCC 
AAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAATAGTCTAGCAACACCTTCTCCATCTCITCCAATCAAT 
10 CCAGGAACTTCACATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAA 
TCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTATTTCTTCAAGAAGGACTTGACAGAAGAGCAAATTAA^ 
GTGCGCAAAAACATTTAG 

4139.1 

15 ATGAAAAAAAGAGCAATAGTGGCAGTCATTGTACTGCITTTGATTC 

TCCAGCAGATTCCACTGGGTGAAGTGCGCTCCTGGATCCCCAATTTCGTTAGCTTGACCrACCTGCAAAATCGAGG 
TGCAGCCTTTTCTATCTTACAAGATCAGCAGCTGTTATTCGCTGTCATTACTCTGGTTGT^^ 
GGTATTTACATAAACACATGGAGGACTCATTCTGGATGGTCnTGGGTTTGACn'CT 
AAACTTTATTGAC^GGGTCAGTCAGGGCTTTGTrGTGGATATGTTCCACCTTGACTl^ 

20 ATGTGG C AG AT AG CT ATCTG ACG GTTG G AGTG ATT ATTTT ATTG ATTG C AATG CT AAAAG AGG AAAT AAATGG AA 

ATTAA 

4139.5 

ATGAATACAAATCTTGCAAGTTTTATCGTTGGACTC^ 

25 ATGGTCAAACCTATGCTCTTGCTAAGGAAGAAGGCCAACATACAGTAGGGGATACGGTCAAAGGTTTTGCATACA 
CGGATATGAAGCAAAAACTCCGCCTGACAACCTTAGAAGTGACTGCCACTCAGGACCAATTTGGTTGGGGACGTG 
TCACAGAGGTTCGTAAGGACTTGGGTGTCmGTGGATACAGGCCrrCCTGACAAGGAAATCGTTGTGTC^ 
TATTCTCCCTGAGCTCAAGGAACTCTGGCCTAAGAAGGGCGACCAACTCTACATCCGTCTTGAAGTGGA 
AGACCGTATCTGGGGCCTCTTGGCTTATCAAGAAGACTTCCAACGTCTTGCTCGTCCTGCCTACAACAACATGCAG 

30 AACCAAAACTGGCCAGCCATTGTTTACCGTCTCAAGCTGTC^^ 

TTGGTTTTATTCATCCTAGCGAGCGTTACGCAGAGCCACGTTTGGGGCAAGTATTAGATGCGCGCGTTATTG^ 
CCGTGAAGTGGACCGCACTCTGAACCTCTCCCTCAAACCACGCTCCTTTGAAATGTTGGAAAACGATGCT 
ATrTTGACTTATTTGGAAAGCAATGGCGGTTrCATGACCTTAAATGACAAGTCATCT^ 
CCTTrGGCATTTCrAAAGGTCAGTTCAAGAAAGCTTTAGGTGGTCTTATGAAGGCTG 

3 5 GTTTGGG ACAG AGTTG ATTTAG 

4139.8 

ATGAAAGATGTTAGTCTATTTTT ATTG AAAAAAGTTTTCAAAAGCCGCTT AAA CTGG 
ATCTGTACTCGGTGTTACCTTTTATTTAAATAGTCAGACrGCAAACTCACACAGCTTG 
40 CGCATTGCAGCCAACGAGAGGGCTATCAATGAAAATGAAGAGAAACTCTCCCAAATGTCTGATACCAGCT CGGA G 
GAATACCAGTTTGCTAAAAATAATTTAGACGTGCAAAAAAATCTTTTGAC^ 

TAAAAGAAGGGCGCTGGAAAGAAGCCTACTATTTGCAGTGGCAAGATGAAGAGAAGAATTATGAATTTG 
ATGACCCGACTGCTAGCCCTGGCTTAAAAATGGGGGTTGACCGCGAACGGAAGATTTACCAAGCCCTGTATCCCT 
TG AAC AT AAAAG C ACATACITTGGAGTTTCCGACCC ACG GG ATTG ATC AG ATTGTCTGG ATTTT AG AGGTTATC AT 

45 CCC^AGTTTGTTTGTGGTTGCTATTATTTTTATGCTAACACAACTATTTGCAGAAAGATATCAA^ 
ACAGCTCACnTATATCCTGTTTCAAAAGTGACATTTGCAATATCCTCTCTTGGAGTTGGAGTG 
TGCTGTTTATCGGAATCTGTGGCTTTTCTTTTCTAGTGGGAAGTCTGATAAGTGGTTTTG 
TACCCAATTTATAGCTTAGTGAATCAAGAAGTAACTATTGGGAAAATACAAGATGTATTATTTCCTGGCTTG 
TAGCTTTCTTAGCCriTATCGTCATTGTGGAAGTTGTGTACITGA 

50 CTCTn'CiTrCACTCATTGGGATTGTTGGCTTATTGTTTG 

TCTGATTCCCTTTACITACTTGCGTTCAGTGGAGATrTTATCT^ 

TAAATTGGAGCATGGGAATGGTCTTACrrCCTTGCCTGATTATCTTTTTG 

TGGGGAAGTTCACAGAAAAAAGAATTTTTTAATAGATTCTAG 

55 4141.1 

ATGATGAAGTTCATATTGGATATTGTTAGTACACCAGCTATTTTAGTAGCT^ 
TCAGAAGAAGAAATTACCTGATATTATTAAAGGTGGAATTAAGACCTTTGTTGGTTTCT^ 
GCAGGAATTGT AC AAAATTCTTTAAATCCATTTGGTACCATGTTTG AG CATGCTTTTC ATTTAG 
GAATAATGAAGCAATTGTAGCTGTAGCTTTAACAACATATGGCTCAGCTACrG 

60 GTGTTCAAT ATCTTAATCG CTCGTTTTACTCG ATTT AAAT AT ATTTTTTT AAC AGGG CACC ACA CTCT AT AT ATG G C 

ATGTATGATTGCGGTC ATTTTATCAGTTGCTGG CTTT ACTAGCTTGCCTCTCATCTTACTAGG AGG ATTAGCA CTCG 
GTATTATTATGAGTATTTCCCCAGCATTTGTGCAAAAATATATGGTTCAATTAACTGGAAATGACAAGGTAGCT 
AGGTCATTTCAGTTCTTTGGGATATTGGTTGAGTGGTTTTACTGGTAGCCnTATCGGTGACA^ 
GAGGACATTAAATTTCCAAAGAGTTTAGCTTTTITAC^^ 

65 TTACATTATTGTAGCTATCITrGCAGGGTCAGAATATATAGAAAAAGAAATCAGTAGTGGTACAAGTGGTCTAGTT 
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TATGCTTTACAATTAGCAGGTCAATTTGCAGC 

AATTGTTCC AGCCTTT AAAGGT ATTTC AG AG CGTCTTGT ACCTAATTC AAAACCTG CTTTGG ATTGTCCG ATTGTTT 

ATACTTATGCACCCAATGCAGTTCTAATTGGATTTATCTCTAGTTTTGTTGGTGGTTTAGTA^ 

ATTGCTTCAGGAACGGTTGTTATCnTACCAGGTGTTGTGCCTCATTTCTTCT 

GAATGCATCTGGTGGTGTTCGTGG AGCCACTATTGG AGC ATU'riU'ACAAGGTATTTTAATCAG'I'l ' 1TC 11 CCAGTCT 
TTTTAATGCCAGTTTTGGGAGGACTrGGTTTCCAAGGATCAACTTTCTCAGA 
ATTTTAGGAATGTTAAATCAATTTGGCTCACAAGCAGGCATTGTGATTGGTCTTGTTCTTATTCT 
TGGAGTATCCTTTATTAAAAAGCCATCTGCAACGGAGGAATAA 

4142.3 

ATGATTAAAACATTTCTCTCTGCCCTTTCGGTCATTCTCTTTTCTATCCCT 
TCTAATCITAACATTTGGCTATCTACCCAACCTATCTTGGCACAGATTTATGCCT^ 
GGCTGCTATTTTAAGTTTCITAlTnTnTCCT^^ 
TTTTGCTCrTACTWTCGCTCATATTACrATTATTCGGAACAGAT^ 

ACCTTAAAATTAGTAACTTGGAACGTCGCTAATCAAATAGAAGCACAACATATTGAGCGAATTTT^ 

ACGCCGATATGGCTATATTCCCTGAACTAGCTACCAATATCAGAGGTGAGCAAGAAAACCAGAGAATCAAACTAT 

TGTTTCATCAAGTTGGACTTTCTATGGCCAACTATGATATTTTCACTTCTCCACCTACCA^ 

GTGACTGTGATTGTCAAGAAAAGTTATGGTTTCTATACAGAAGCTAAAACTTTTCATACA^ 

TTGTATTACATTCGAGAAAACAAAATATACCAGATATCATTGCCTTGCATACTGCGCCTCCTCTGCCAGGTTTAAT 

GG AAATCTGG AAGCAAG ACTT A AACATC ATTC AT AATCAATTGGCTTC AAAAT ATCCA AAGG CT ATT ATTGCAG GT 

GATTTTAATGCAACTATGCGTCATGGAGCACTTGCAAAAATAAGCTCTCATAGGGACGCATTAA ATGC ACTGCCA 

CCTTTTGAAAGAGGAACTTGGAATAGCCAAAGTCCAAAACTTTTTAATGCAACAATAGATCA TA 

AAAACCACTACTATGTTAAAGATITAGACATTGTAAGTTTrCAAAACTCTGATC^ 

CACATTTTAA 

4142.4 

ATGAATCCAATCCAAAGATCTTGGGCTTATGTCAGCAGAAAGCGACTGAGAAGTTTTATTrTATTTCT 
TGGTCTTATTGGCCGGAATTTCAGCCTGTTTGACTCTC 

ATCACTCAATACATCTITITCTATTAAGAAGATAGAGAATGGTCAGACATTCAAGTTGTCAGACCTAGCATCTGTA 

AGCAAGATTAAGGGGCTGGAAAATGTCTCTCCTGAACTTGAGACGGTCGCAAAACTAAAAGACAAGGAAGCAGTG 

ACTGGCGAGCAGAGCGTGGAGCGTGATGATTTATCAGCTGCAGACAATAACTTGGTTAGCTTAACGGCTCrrGAG 

GATrCATCCAAGGATGTAACCTTTACCAGTTCGGCTTTCAATCTAAAAGAAGGGCGACACCTTCA^ 

TCCAAGAAAATCCTTATCCACGAAGAATTGGCTAAGAAGAACGGTCTTTCGCTTCATGACAAGATO 

CTGGTCAGTCTGAATCTGGAAAAGGACAAACAGTAGAGTTTGAGATTATCGGCATCTTTTCTGGTAAAAAA CAAG 

AGAAATTCACAGGCTTGTCTTCTGACTTCAGTGAAAATCAAG 

GGGCAATAGTGAAGCTCAAGTCAGTGCAGCACGCTTCTATGTAGAAAATCCTAAGGAAAT GGACG GACTCATGAA 

GCAGGTAGAAAACTTGGCCTTGGAAAATCAAGGCTACCAAGTCGAAAAGGAAA^ 

AGACTCAGTTGCAACTTTCCAAACCTTCCTGACCATOT 

ATTCTGGTTTTGTCrCTCTGGTTGAGAGAACGGGTCrATGAAGTGGGGATTTTACTTG 
GCTCGATCTTCCTACAATTCrGTTTAGAGGTAGTTTTGGTATCTCTTGGAGC^^ 

AACGCAATCACAACTTACCTACTCCAAACTCTACTAGCAAGTGGAGATCAGGCAAGCTTACAAGATACACTAGCC 

AAAGCAAGCAGTTTATCAACTAGCATCTTATCTTTTGCAGAATCCTATGTTTTTCT^ 

TGTAGCCCTTTGTTTCCTATTCTTATTTAGAAAATCACCG 

4142.5 

ATGTTACACAACGCATTTGCCTATGTTACAAGGAAGTTTITCAAATCGATTGTCATCTTCCTGATTATTCT 

GGCGAGCTTGAGTTTGGTCGGCTTGTCAATCAAGGGAGCTACTGCCAAGGCTTCTCAGGAGACCTTT 

ACCXATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGGAACGCCTCGTGGTGCTGGGAATATCAAGGGT 

GAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGAGTCTTATGTCAAACGTATCAACGCTATCGGAGATTTG 

ACTGGATATGACCTGATTGAAACGCCAGAAACCAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGT 

AGCTTGATGATTACAGGTGTCAATGACTCCTCTAAAGAAGACAAGTTTGTCTCTGGTTCTTATAAACT 

GAGAGCACTTAACCAACGACGACAAGGATAAAATCCTCTTGCACAAGGACTTGGCAGCCAAACACGGCTGGAAA 

GTAGGGGACAAGGTTAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCCAAGGAAACAGTTGA 

AGTGACAATCAAGGGACTCTTTGATGGTCATAATAACTCAGCAGTAACCTACTCACAAGAACmA^^ 

AGCTATT AC AG AC ATTC ACACTG CTG CAAAACTTT ATGG AT AC AC AG AAG AC AC AG CC ATTTATGGGG ACGC AAC 

CTTCTTTGTAACAGCAGACAAGAACnTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTATCAACTGGAA 

GAGCTACACACrCGTCAAGAGCTCCTCTAACTACCCAGCTCTTGAGCAATCTATCTCT 

AACCTCCTCTTCTGGGGTAGCTTGAGCTTCTCAGTTCT 

TCGCAAGGAAGTGGGAATTCTCCrCrCTATCGGCCTCAAGCAGGCAAGTATCTTGGGTCA 
ATTCATCACCGAATCTATCTTGATTGCTATCCCTGCrCTAGTTTCTGCTTAOT 

CAATTGGAAACACTGTCCTTGCCAATGTGACTTCAGGTGTrGCCAAACAGGCTAGTAAGGCGGCTCAAGCCT 
ACCT TGGTGGTGGTGCAGAAGTAGATGGCrTTAGCAAGACCTTGTCGAGCCTAGACATTTCCATTCAGACATCAGA 
CITTATCATCATTTITGTCCTTGCCTTGGTTCTAGTGGTTCrCGTTATG 
AAC A ACC A AAAG AGCTCTTGCTGG ATG GTG A AT A A 
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4144.1 

ATGTCACAGGATAAACAAATGAAAGCTGm 
GGGTTGGGAGTTTGATTTTCTGTATTTGGGCTTATCAGGCT 
ATCCAGCAGGCAGGCATCTGGGGTCCACCTCTCTTrATCTTTTTACAGATTT^ 
5 AGGGGCCTTGACCTCGGTGGCrGGGGTCTTTATCTACXKjGCACATCATCGGGACTATCTACAACTATATCGGC^TC 
GTGATTGGCTGTGCCATTATCTTTTATCTAGTGCGCCTATACGGAGCTGCCTITGT CCAGTCTGTC GTCAGCAAGC 
GCACCTACGACAAGTACATCGACTGGCTAGATAAGGGCAATCGTTTTGACCGCTTCm 

GCCCATTAGCCCAGCTGACTTTCTCTGTATGCTGGCTGCCCrGACCAAGATGAGCTTCAAG CGCTAC ATGACCATC 
ATCATTCTG ACCAAACCCTTT ACCCTCGTGGTTTATACCTACGGTCTG ACCTATATTATTG ACl" in i CTGGCAAAT 
10 GCTTTGA 

4144.2 

ATGAGAAATATGTGGGTTGTAATCAAGGAAACCTATCTTCGACATGTCGAGTCATGGAGriU'CUH'CrriATGGTGA 

TTTCGCCGTTCCTCTTTTTAGGAATCTCTGTAGGAATTGGGCATCT 
15 GTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTTAACTTCGACTATAAA 
GACGAAGCAAGTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCATTGATCAAGAAGATAGT 
GTTCTAAAGGGAGTTTATCATGGCGAAACATCGCTTGAAAATGGAATTAAATTTC 
AACTGCAAAATCAGCITAATCGTTCAACTGCTTCCTTGTCTCAAGAGCAGGAAA 

AATTCACAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGGTGCCTT 
20 T CTlTU ' l 1 ATATGATTCTGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGCCAGTGAAAAAGGCACCAAAAT 

TATGGAAGTCGTTTTTTCTAGCATAAGGGCAAGTCACTATTTCTATGCGCGGATGATGGCTCTG 
T AACG C AT ATTGGG ATCTATGTTGT AG GTG GTCTG G CTGCCGTTTTG CTCTTT AAAG ATTTG CC A TTCTTG G CTCAG 
TCTGGTATTTTGGATCACTTGGGAGATGCTATCT^ 

CGTAGTCTTGGCAGCCTTCCTAGGATCTATGGTTTCTCGTCCTGAGGACT^ 
25 ATTTTGATTATGGGTGGTTITmGGAGTGACAGCTCrAGGTGCAGCTGG 

TTATATTCCCTTrATTTCGACCTTCTTTATGCCGTTTCGAACGATTAATGACT 

ATTTCACTTG CTATT AC AGTG ATTTTTG CGGTGGTAG C AAC AGG ATTT ATCGG ACG C ATGTATG CT AGTCTCGTTCT 
TCAAACGGATGATTTAGGGATTTGGAAAACCnTrAAACGTGCCTTATCTTATAAATAG 

30 4144.3 

ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGCGC^GGTTTAAAGAGCAAGAAATTCCCCAAGTA 
G ACTT AAATG AG ATTTTG AC AGC AG CCC AG ATGG C ATC ATCTTG G AAG AATTTCC AATCCTACTCTGTG ATTGT GG 
TACGAAGTCAAGAGAAGAAAGATGCCITGTATGAATTGGTACCTCAAGAAGCCA 

TCrCrTTGTCGGAGATTTGAACCGAGCAGAAAAGGGAGCCCGACTTCATACCGACACCTTCCAACCCCAA 

35 GGAAGGTCTCTTGATTAGTTCGGTCXjATGCAGCTCTTGCTGGACAAAACGCOT 

TATGGTGGTGTGATTATCGGTTTGGTTCGATACAAGTCTGAAGAAGTGGCAGAGCTCTTTAACCTACCTGACT 
CCTATTCnXjTCTTTGGGATGGCACTGGGTGTGCCAAATCAACATCATGATATGAAACCGAGACTGCC 
ATGTTGTCTTTGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCTTATGACCGTGTTCAGGCTGACT 
ATGCTGGGGCGCGTGCGACCACAAGCTGGAGTCAGCGCCTAGCAGAACAGTTTGGTCAAGCTGAACCAAGCTG^ 

40 CTAGAAAAAATCTTGAACAGAAGAAATTATTGTAG 

4146.1 

ATGTTAAAACTTATTGCTATTGTTGGAACAAATTCAAAACG 
AACACTTTACTGACAAAGCTGAAATTGAACITGTTGAAATCAAGGCC 
45 GCAAGTACCTGCTGAAATATTGGAAATTGCTGCTAAAATCGAAGAGGCAGATGGCGTTATTATCG GTACTC CTGA 

GTATGATCACTCTATTCCAGCTGTTTTGATGAGCGCTCTTGCTrGGTTGTOT 

AACCAATCATGATTACAGGTGCTTCTTACGGTACGCTTGGTTCATCTCGTGCCCAATTGCAAOT 

GAATGCTCCTGAAATCAAGGCAAATGTTCTTCCAGATGAATTCTTGCTCTCACACTCT 

AGTGGCG ACTTGGTTG ACCTTG ATGTT ATC AAG A A ATTGG ATG CCATCTTTG ATG ACTTCCGT ATCTTTGT AAAAA 
50 TCACAGAAAAATTACGTAATGCACAAGAATTACTTCGCAAA^^^ 

4146.2 

ATGAATACCrATCAATTAAATAATGGAGTAGAAATTCCAGTATTC 

GAAGAAGCCTATCGTGCAGTGTTAGAAGCCTrGAAGGCrGGTTATCGTCATATTGATACGGCGGCGATTTATCAGA 
55 ATGAAGAAAGTGTTGGTCAAGCAATCAAAGATAGCGGAGTTCCACGTGAAGAAATGTTCGTAACT^ 

GGAATAGTCAGCAAACCTATGAGCAAACTCGTCAAGCTTTGGAAAAATCTATAGAAAAACTGGGOT 

TGGATTTGTATTTGATTCATTGGCCGAACCCAAAACCGCTCAGAGAAAATGACGCATGGAAAAC TCGCA ATGCGG 

AAGTTTGGAGAGCGATGGAAGACCrCTATCAAGAAGGGAAAA^ 

ATCATTTGGATGCCTrGCrTGAAACTGCAACrATCGTTCCrGCGGTCAATCAAGTTC 
60 TCAAGATCAAGTCGTAGCTTACTGTCGTGAAAAGGGAATTITATO 

ACTGTTTGATAGCAAGCAAGTCCAAGAAATAGCAGCAAATCACGGAAAATCGGTTGCTCAGATAGCCTTGG CCTG 

GAGCTTGGCAGAAGGATTITTACCACTTCCAAAATCTGTCACAACCT 

GGAATTGAACTGAGTCATGAGGAGAGAGAAACCTTAAAAACGATTGCTGTTCAATCGGGTGCTCCACGAGTTGAT 
GATGTGGATTTCTAG 

65 
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4147.1 

ATGAGGTGCAAAATGCTTGATCCAATTGCTATTCAACTAGGACCCCTAGCCATTCGTTGGTATGCCrTATGT ATTG 

TGACAGGCTTGATTCTTGCGGTTTATTTGACCATGAAAGAAGCACT 

AGATTTTATCTTAGTAGCCTTTCCCITGGCTATTTTAGGAGCT 

TAGTCAGAATTTAGGAGAGATITTTGCCATTTGGAATGGTGGTTTGGCCATTTA 

CTTGTGCTCTATATCTTTGCTGACCGTAAACTCATCAATACTTGGGATm 

GATTGCTCAAAGTTTGGGGCGTTGGGGTAATlTCTrrAACCAAG 

TATCTACCTGGCITTATCCGTGACCAGATGTATATTGAGGGGAGCTACCGTCAACCGACTTTCCTTTATGAGTCr 

TATGGAATCTGCTTGGCTTTGCCTTGATTCTGATTTTTAGACGGAAATGGA^ 

GGCCTTTTACTTGATTTGGTATGGTTTCGGTCGTATGGTTATCGAAGGTATGCGAACAGATAGT^ 

GCTTTCGAGTGTCCCAATGGCrGTCAGTTGTCCTTATCGGTCTCGGTATAATGATCGTTATTTATCAAAATCGA^ 

GAAGGCCCCTTACTATATTACAGAGGAGGAAAACTAA 

4147.2 

ATGGGTAAATTATCCTCAATCCTTTTAGGAACCGTTTCAGGTGCAGCTCTTG 

GCAAACAAGTTTGCAGTCAGGCTCAAGATTTTCTAGATGATTTGAGAGAAGATCCGGAGTATGCCAAGGAGCAAG 

TCTGTGAAAAACTGACAGAAGTTAAGGAGCAGGCTACAGATTTTGTTCTGAAAACAAAAGAACAGGTTGAGTCAG 

GTGAAATCACTGTGGACAGTATACTTGCTCAAACTAAATCCTATGCTTTTCAAGCGACAGAAGCATCAAAAAATC 

AATTAAATAATCTCAAGGAGCAATGGCAAGAAAAAGCCGAAGCTCrTGATGACTCAGAAGAGATTGTGATTGATA 

TAACAGAAGAATAA 

4147.3 

ATGAAAACTAAATTGATCTTTTGGGGCrCTATGCTCTTTCTCCTCTCCCT 

TTGGATTTTCTATCCTATGGAGATTCAGTGGCTAAACTTAACGAATCGAGTCTATCTAAAACCA 
TACAATTTTCATATCTTGATGAATTATCTGACCAATCCTTTTAGTCAGGTCTTACAGATG 
AGCAGCTGGTCTGCACCATTTCGCAGTGGTCAAGAATCTCTTTCATTTGGTTCAGCTAGTAGCTCT^ 
CC AAGTTTCTATGTCTTTGTCAAT AG G ATTGTG AAAAAG G ACTTTTTGTCTCTTTATOT 
AGTAGTCTTACCTGTGATGATTGGACTTGGGGGAGTTTTGATTGGTTTTGACCAATTCTT T 

TTCTCTTTGTGGG AG ATGATACCTGGC I'l 'IT 1 G ATCC AGCCAAGG ATCCTGTTATTATG ATTTTGCCAG AG ACCTTC 

TTTCTTCATGCCTTCCTCCTCTTTrTTGCCCTCTATGAAAACTTC 

GTGA 

4149.1 

ATGACTTATCATTTTACTGAAGAATACGATATTATTGTAATTGGTGCGGGACACGCTGGGGTTGAGGCTTCCTTG^ 

CCGCTAGCCGTATGGGCTGTAAGGTCCTGCTTGCGACCATCAATATTGAAATGCTGGCTTTCATGCCTTGTAATCC 

CTCTATCGGTGGTTCTGCCAAGGGGATTGTCGTGCGTGAAGTCGATGCCCTCGGTGGCGAGATGGCCAAAACCATT 

GACAAGACTTACATCCAGATGAAGATGCTAAACACAGGGAAGGGGCCAGCTGTCCGTGCCCTTCGTGCGCAGGCT 

GACAAGGAACTTTACTCTAAGGAGATGCGCAAGACGGTTGAAAACCAAGAAAATCTGACCCTTCGTCAAACCAT^ 

ATTGATGAGATTTTGGTGGAAGATGGCAAGGTTGTCGGTGTGCGTACAGCCACCCATCAAGAATATGCTGCTAAG 

GCTGTTATTGTGACGACACMjGACTGCTCTCCGTGGGGAAATTATCATCGGAGACCTCAAGTACTCATCAGGTCCTA 

accat^gcttggcttctattaacctagctgacaatctcaaggaactgggtctcgaaatcgg 
aacccctccacgtgtcaaggcttcttctatcaattacgatgtgacagaaattcagccaggagacgaagtgcctaat 
catttctcatacacttcacgtgatgaggattatgtcaaggaccaagtaccatgctggttgacctataccaatggta 
ccagtcatgagattatccaaaacaacctccaccgtgcgcctatgtttacaggtgtggtcaagggagtggggcctc 

GTTACTOTCCGTCGATTGAAGACAAGATTGTGCGC^ 

AGGGCGCAATACTGAGGAAGTCTATGTGCAAGGACTTTCAACCAGTCTGCCTGAGGATGTCCAGCGTGACTTGGT 

GCATTCCATCAAAGGTTTGGAAAATGCAGAGATGATGCGGACAGGTTATGCTATTGA 

GTATGATATGGTCnTGCCTCATCAGTTGCGTGCGACTTTGGAAACCAAGA 

CAGACAAATGGAACATCAGGTTACGAAGAGGCAGCAGGCCAAGGGATTATCGCGGGTATCAATGCGGCTCTGAA 

AATCCAAGGCAAGCCTGAATTGATTTTGAAGCGCAGTGATGGTTATATCGGGGTGATGATCGACGACTTGGTGAC 

CAAGGGAACCATTGAACCCTACCGTCTCTTGACCAGTCGTGCTGAATACCGTCTCATTCTTCGTCATGACAATGCT 

GATATGCGCTTGACTGAGATGGGACGCGAGATTGGCCTTGTGGACGATGAACGCTGGGCTCGTTTTGAAATCAAG 

AAAAATCAATTTGATAATGAGATGAAGCGCCTAGACAGTATCAAACTCAAGCCAGTCAAGGAAACCAATGCCAAG 

GTTGAGGAGATGGGCTTCAAACCCTTGACCGATGCAGTGACAGCCAAGGAATTCCTTCGCCGTCCAGAAGTTTCTT 

ACCAAGATGTGGTGGCCTTCATCGGACCAGCTGCAGAAGACTTGGATGACAAGATTATCGAATTGATTGAAACAG 

AAATCAAGTATGAAGGCTATATTTCCAAAGCCATGGACCAGGTTGCCAAGATGAAACGCATGGAAGAAAAACGCA 

TTCCGGCCAATATCGACTGGGATGACATTGATTCTATCGCAACCGAAGCCCGTCAGAAGTTCAAACTCATCAATCC 

AGAAACCATCGGCCAAGCCAGCCGTATTTCGGGAGTAAACCCAGCAGATATTTCTATTTTGATGGTGTATCTGGAA 

GGT AAA AATCGT AGT ATTTCT AAAACTCTTCAA AAATCAAAATG A 



4149.2 

ATGAAAGTATTAGCTTTTGATACGTCCAGCAAGGCTCTTrCTCTGGCTATTTTAGAGGATAAGCAGG 
AGACGACGATTAATATTAAGAAAAATCACAGTATTACTCTTATGCCTGCCATCGATTTTTTGATGGCAAGTTTGGA 
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TTGGACACCCAAGGATTTGGACCGAATCGTGGTAGCTGAAGGGCCGGGTAGCn'ATACAGGCITGCGAATTGCGGT 

AGCAACTGCTAAGACCTTAGCTCACACCCTGAACATCGAGTTGGTTGGTATGTCGAGTCTCTrGGCTCTGGTGCCC 

CATCAACAAGAAGGTTTGTTTGTCCCCTTGATGGATGCGCGTCGCAATAATGTTTATGCAGGATTT^ 

CCAAACCTGTCATGGCAGAAGCGCACCTATCTTTTGAAGAGGTGCTAGAAAAAGTCAAGGGTACTAGTCAGGTAA 

CCTTTGTCGGAGAAGTTGGCCCCTTTGTTGAGCAGATTCAAAAACACTTGCCAAGGACT^ 

GCCCAATGCAGCTAATCTAGCTCTTTTGGCCTGGGACAAGGAAGCAGACTCOT 

CTCAAACGAGTCGAGGCTGAGGAAAACTGGCTCAAGAACCATACCGAGTCTGGCGAGTCTTACATTAAACGCCTA 
TGA 

4149.3 

ATGATAGAAATCAAGCGAATTCAACAACAGCCTGACCTAGCTCAAGCCATCTACGCTGTTATGGCAGCTC 

TAGTCAGTCCXTGGACTCTGGAGCAAATCCAAGCAGATCTGTCCCAAGACCAGACTTGGTATGCATTGGCTr 

TGGGGCAGAAGTGATTGGATTTCTAGCTGTGCAGGAGAATL'riUUUGAAGCAGAAGTCCTGCAAATCGCTGTCAA 

AGGAGCTTATCAGGGTCAGGGGATTGCGTCagCCTTGTTTGCTCAATTGCCGACAGACAAGGAAATTTTCCT 

GTCAGACAGTCAAATCAACGAGCGCAAGCATTTTACAAGAAAGAAAAGATGACAGTTATCGCTGAGCGAAAGGC 

CTACTACCATGACCCAGTCGAGGACGCCATTATCATGAAGAGAGAAATAGATGAAGGATAG 

4152.2 

ATG AC AAAAC AAGTCTT ATTAGTG G ATG ATG AAG AACAC ATTCTG AAATTG CTTG ACTACC ATTT AAGT AAGG AA 

GGCM'riU'CTACTCAATTGGTGACAAATGGACGGAAGGCCTTAGCnTTGGCAGAAACAGAACCCnTr 

TGCTTGATATCATGTTACCACAATTAGATGGCATGGAAGTTTGTAAGCGGCTGAGAGCCAAAGGCGTCAAAACTC 

CAATTATGATGGTTTCTGCGAAAAGTGATGAATTTGATAAGGTTTTGGCCTTGGAATTAGGGGCTGA 

GACCAAGCCTTTTAGCCCTAGAGAATTGCTGGCGCGTGTCAAGGCTGTCCTCAGGCGAACTAAAGGAGAACAAGA 

AGGAGATGATTCAGATAATATCGCTGACGATTCTTGGCTATTTGGGACCTTGAAAGTATACCCTGAGCGTCATG^ 

GTCTACAAGGCGAATAAGTTACTGAGTTTGACCCCAAAAGAATTrGAAAG 

TCAAAGTTTCGAAAGTAACCGCCCAATAA 

4154.1 

ATGACTACTTTTAAAGATGGATTTTTATGGGGTGGTGCTGTTGCTGCTCATCAACrTGAAGGTG 
GTCGCAAGGGAATTAGTGTTGCTGATGTTATGACTGCTGGTCGTCATGGAG 

TTT AG AGGGT AAATATT ATCC AAATC ATG AG G CG AT AG ATTTTT ATC ACCGTT AT AAAG AAG ATAT AG C ACTTTTT 

GCTGAAATGGGATTCAAGTGCTTCCGTACCTCTATTGCATGGACACGTATCTTTCCAAAAGGTGATGAGTTAGAGC 

CGAATGAAGAAGGATTACAGTTTTATGATAATCTTTTTGATGAATGCTTAAAGAATGGTATTGAACCTCT 

TCT ATCTC ATTTTG AAATG CCTT ATC ACTT AGTG ACCG AAT ATGGTGGTTGG AAAAAT AG G AAATTG ATTGATTTC 

TTTGCTCGTTITGCAGAAGTCGTATTTAAACGTTACAAAGATAAGGTTAAATATTGGATGAC^ 

AT AATC AAG CGAATT ATC AGG AAG ATTTTG CACC ATTT ACT AACTCAGGTATTGTATATG AG GAAGGTGATAATAG 

AGAAGCAATTATGTATCAAGCAGCACATTACGAATTAGTTGCTTCTGCACGAGCTGTAAAAATTGGTCATGAGATT 

AATCC AG ATTTTCAAAT AG GTTGT ATG ATTG CG ATGTGTCC AATTTATCCAGTT ACTTGC AATCCT AAGG AT ATCTT 

AATGGCAATGAAAGCTATGCAGAAGCGTTATTATTTTGCTGATGTGCATGTTTTAGCTAAATATCCT 

TTCAAGTATTGGGAACGAAAAGGTATTTCAGTTGATTTTACTGCCCAGGATAAAGAAGATT^ 

TAGATTACATTGGTTTCAGTTACTATATCTCCTTTGCTA 

CTTGAAACAGAAGATTTAGTGAAAAATAATTATGTTAAGGCTTCTGAATGGGAGTGGCAAATTG 
TTGCGTTATGCGTTAAATTGGTTTACAGACCACTATCACTTACCACrCTTTATTC 

AGATCAAGTTGCAGCAGATGGTATGGTACATGATGATT AT AGAATTG AAT ATCTAGGTGCCCATATTCGTG AAATG 
AAAAAGGCTGTAGTTGAAGATGGTGTTGATTTAATGGGTTATACTCCATGGGGATGTATTG 
GTACCGGTGAAATGCGGAAACGTTATGGCTTTATTTATGTAGATAAAGATGATAATGGGAAGGGAAGTTATAATC 
GTTCCCCGAAAAAATCTTTTGGCTGGTATAAGGAAGTTATTTCATCTAACGGTGAATCAGTAGAATAG 

4154.2 

ATGGATCAACAAAACGGGTTGTTTGG 1"I TIC l"l G AAAACCATGTTATGGG ACCAATGGGC AAACTTGCTCAGTTTA 

AAGTAGTACGTGCTATCACGGCTGCAGGTATGGCTGCTGTACCATITACTATTGTAGGATC 

CAGTATTTTGCCACAAGCTTTCTCATTTTGGCCAATTGTGGCAGATATT^ 

C ACTTTAC ATGGTTG C AAACT ATG CG ACT ATGGGTTCTCTATCTCTTTATTTCGTTCT ATC ACTTGC ATATGAATTG 

ACAAAAATTTATGCAGAGGAAGA^AACTCAATATGAATCCTCTTAATGGTGCCrTGCTTGCCTTGA 

TCATGACAGTACCGCAAATCATTTTTGATGGTGGAATGATGAAGACTGTGACAAGTCTAAAAGAAGGTGCAGTAA 

TTGCAGATGGATGGGCAATGGGAAATGTACTCGCACGTTTTGGGACAACAGGGATTTTTACCGCAATCATTATGG 

CAATTGTGACTGTTCTTATTTATCGTATGTGTGTTAAACATAATTGGGTTATTAAAATGCCTGAA 

GGAGTTTCTCGTGGATTTACCGCTTTGGTTCCGGGATTTGTTGT^ 

GTAGCAATGGGAACAGATATTTTTAAAGTCATTGCAATTCCATTTGGTTTTGTATCCAATCT 

TTGGTTTAATGATTATTTATCTATTGACTCAACTACTTTGGATTGTAGGTATCCACGGTGC^ 

TTTGTTAGTCCAATTGCTCTTGCTAACATGGCTGAAAATGCTGCTGGCGGGCACTTCGCTGTTGCA 

CTAAT ATGTTTGTAATTGCAGGTGGTTCTG GTG CAACTTTAGG ACT ATGTTT ATAT ATTGCTTTTGCCTCTAAATCT 

GAACAGCTTAAAGCAATAGGACGAGCATCTGTAGTTCCAGCCTTATTTAATATTAATGAACCATTAATTIT^ 

TACCTATTATCTATAATCCAGCCTTGGCTATACCATTTATTTTAGCACCAATGGTTACTGCTACTAm 

GCGAATTCTCTAAACTTTATTAAGCCAATTATCGCACAGGTTCCATGGCCAACTCCAGTAGGGATTGGAGCTT^ 
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TAGGGACAGCAGATCTTCGAGCTGTATTAGTTGCTCTAGTATC^ 
ATCCGTGTATATGATCAAAAATTGGTGAAAGAAGAGCAAGGTATCTAA 

4155.1 

5 ATGAAAAAATTTTATGTAAGTCCAATTTTTCCTATTCTAGTAGG 
TATTTTTGTTAATAATAATCTGTTGACGGTTTTMTTTTGm 

ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAAGTTTGAC 

AGCTCTATTGGAACAGATGCCTGTAGGTGTTATGAAATTGAATT^^ 

TATGCTGAATTGATTTTGACCAAGGAAGATGGTGATTTTGATTTAGAAGCTGTTC^ 

10 TAGGAAATCCGTCTACTTATGCCAAGCTTGGTGAGAAGCGTTATG 

GTATTITGTAGATGTATCCAGGGAACAAGCCATAAC^GATGAATTGGTAACAAGTAGACCAGTG ATTGGG ATTGT 
CrCTGTGGATAATTATGATGATTTGGAGGATGAAACTTCTGAGTCAGATATTAGTCAAATCA ATAut 1 1 iGTAGCT 
AATTTTATATCAGAGTTTTCAGAAAAACACATGATGTTTTCTCGTCGGGTAAGTATGGATCGATTTTATCT 
TGACTACACGGTGCTTGAGGGCTTGATGAATGATAAATTTTCTGTTATTGATGCTTTCAGAGAAGAGT 

15 AGACAGTTGCCCTTGACCTTAAGTATGGGATTTTCTTATGGCGATGGAAATCATGATGAGATAGGGAAAGT^ 

TGCTCAATXTGAACTTGGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAAGGAAAACGACGAAACGAAAAATC 
CAGTTTATTTTGGTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGCGCTATG ATGA CAGCTA 
TTCAGATAAGATTCGGAGTGTAGATCAGGTTTTTGTAGTCGGTCACAAAAATTTAGACATGGATGCTTTGG 
GCTGTAGGTATGCAGTTGTTCGCCAGCAATGTGATTGAAAATAGCTATGCTCTITATGATGAAGAACAAATGTCTC 

20 CAGATATTGAACGAGCTGTTTCATTCATAGAAAAAGAAGGAGTTACGAAGTTGTTGTCTGT^ 
GGATGGTGACCAATCGTTCmGTTGATTCTTGTAGACCATTCAAAGACAGCCTTAACATTAT 
TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGATCAGGATTTTCCAGATAATGCGGTTATTACT 
TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGTTGGTAACGGAATTGATTCAGTTCCAGAATTCTAAGAAAAAT 
CGTTTGAGTCGTATGCAAGCAAGTGTCTTGATGGCTGGTATGATGTTGGATACTAAAAATTTCACCTCGCGAGTAA 

25 CTAGTCGGACATTTGATGTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATCCAGGAAATCGCTGC 
GACAGATTTTGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATGTACT 
GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTAGCCATGTCA 
GGTATTGAAGCGAGTTTTGTTCTTGCGAAGAATACACAAGGATTTATCTCTATCTCAGCTCGAAGTOT 
TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACTTTAATTTGGCAGCAGCTCAAATTAAAGATG 

30 TAACCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAAAAGGAGAAAGAAG^ 
TGA 

4156.1 

ATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGGATTTTC 
35 TATATCAGGTTCCCCTAGTGGTTACCTCTATTTTGACTTTAAAAGAAGTAGCCCTGCTACAGTCA GGGCT G ATAGT 
TGCTGGCCTTTCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCTCGTAAAACCAAGTTAGCT 
CnTTTTTTAGAGCTAAAGATTTGGCACGTTTGGGCTTGAGTTATCTAGTTATTGTCGGOT 

ATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGATATGGTTCAAAATAGTTCGT 
TGATTTCCAGTTTCTTCTTGCTAGCCTrGCTTGCTCCGATTTGTGAGGAAATCITGTGTOT 

40 AAGATTTTCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGATTGTGTTTGCTT^ 

ATTTACCTrCTTTATTGATTTATGGAGGTATGTCGACAGTTCTATCTTGGACAGCCTACAAGACCCAA 

AATGTCGATCTTGCTTCACATGATTGTTAATGGGATTGCTTTCTGTTTGTTGGCTCTTGTGGTGATTATG AG 

C ATT AG G AATTTCTGTTTAAATG AAAG AG AAAAAT ATGTGG A AAG AATTGTTG AATCGTGC AG GCTG G ATTTTGGT 

CTITrTACTTGCCGTCCTTTTATATC^ 

45 TACAGTCAGGGCTGATAGTTGCTGGCCTTTCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCTC 

GTTAGCTAGTTTTAA' 1 ' l ' n ' l ' Cl 1 1 r TTTAGAGCTAAAGATTTGGCACGTTTGGGCTTGAGTTATCTAGTTATTGTCG 
GGTCAAATATACTTGGTTCCATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGA 
TATGGTTCAAAATAGTTCGTTGATrTCCAGTTTCTTCITGCTAGCCTTGCT^ 

GTCGTGGGATTGTrCCTAAAAAGATTTTCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGATTGTG^ 
50 TTTATTGCATCAACCAAGTAArrTACCTTCTTTATTGATTTATGGAGGTATGT^^ 

ACAAGACCCAACGTTTGGAAATGTCGATCTTGCTrCACATGATTGTTAATGGGATTGCTTTCT 
GTGGTGATTATGAGTCGGACATTAGGAATTTCTGTTTAA 

4156.4 

55 ATGGATACACAAAAGATTGAAGCGGCTGTAAAAATGATTATCGAGGCTGTAGGAGAGGACGCTAATCGCGAGGGC 
TTGCAGGAAACACCTGCTCGTGTAGCCCGTATGTATCAAGAGATTTTTTCAGGTCTTGGTCAAACAGCAGAGGAAC 
ATrTGTCAAAATCCTTTGAAATTATTGACGATAATATGGTGGTAGAAAAGGATATCTTTTTCCATACCA 
ACACCACTTCTTGCCATTTTATGGTAGAGCGCACATTGCCTACATTCCAGATGGTCGTGTGGCAGGCTTGTCTAAG 
CT AG CCCGT ACG GTTG AAGTTTATTCG AAAAA ACC AC A A ATTC AAG AACGTTTG AAT ATCG AAGTG GCCG ATG CC 

60 TTGATGGACTATCTAGGTGCTAAAGGAGCCTTrGTTGTCATTGAGGCGGAACATATGTGTATGAGTATGCGTGGTG 
TTAGAAAACCAGGCACTGCAACCTTGACGACAGTAGCTCGTGGTCTATTTGAAACAGATAAGGATCTCCGTGACC 
AAG CTT ATCGTTT AATG GGGCT ATAA 

4157.2 
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ATGAAAGACTTGTTTTTAAAGAGAAAGCAGGCCTTTCGTAAGGAGTGTCTTGGTTATCTGCGCTA 

ACCACTTTGTCTTGTTCCTGCHTGTCCTGTTGGGCT 

GAAAATCATTGGCCTATCCTTTTGTTTGTAGGAATTACGTCTGTTrr^ 

TATGGAGGCTCCAGACAAGCTCTTTCTCTTAGTTGGAGAAGAGGAAATTAAGCTCCATCTCAAGCGTC 
5 CATTTCCCTAGTCTTTTGGCTCTTTGTACAGACCCITrrCTTGCT 

TGGCTTGCCAGTTTTTCTGCTCTATGTGCTTTTATTGGGGGTAGGAAAATATT^ 
AATTTTTCACTGAAACTGGACTGGACTGGGACTATGTTATTTCTCAAGAAAGCAAGCGTA^ 
TTTCTTTGCCCTCTTTACGCAGGTCAAGGGAATTTCAAACAGCGTTAAGCGTCGTGCCTATCT 
AGGCTGTTCAGAAGGTGCCTGGGAAGATTTGGCAAAATCTCT^ 
10 TGCTCTCAGTCTTCGTCTTCTCTTGCTTTCCOT 

TGGTAGTTCTCTTTAACTACCTCTTGCTCTTCCAGTTGCTGGCCCTCTATCATGCCTT^ 
CAACTCTTTCCGCTGGACAAGGGGCAAAAGGAAAAAGGCTTACAGGAGGTAGTTCGAGGATTGACCAGTTTT 
TTACTTGTGGAATTAGTTGTTGGGTTGATTACCTTCCAAGAAAAACTAGCCCTTCTAGCCTTA 
TGGTTTTACTAGTCTTGTATTTGCCTTATCAGGTAAAACGTCAGATGCAGGACTAA 

15 

4158.! 

ATGAGAAAATCAATAGTATTAGCGGCAGATAATGCCTATCTTATTCCTTTAGAGACGACTATAAAGTCTC 

ATCACAATAGAGATGTTGATTTTTATATTCTCAACAGTGATATAGCTCCTGAATGGTTT 

AATGGAAGTTGTGAATTCTACAATTCGCAGTGTACACATTGATAAAGAACnTTTTGAAAGCTATAAA^ 

20 CATATAAATTATGCrTCrTACTTTAGATTTTTTGCGACAGAAGTGGTTGAATCTGAT 

CGATATCATTGTAACTGGGGAACTAGCTACTTTGTTTGAGATAGATCTCAAAGGATATTCAAT^ 
GATGTCTATGCCTATGAAGGACGAAAATCTGGATITAATACTGGTATGTTACTAATGGATGTTGCAAAGTGGAAAG 
AACATTCTATTGTCAATAGTTTATTGGAATTAGCGGCCGAGCAGAATCAAGTTGTTCATCTTGGGGATCAGAGTAT 
TTTAAATATTTATTTTGAGGATAATTGGCTAGCCTTAGATAAAACA 

25 ACCTTGCTCAAGAATGTGAACGTCTAGATGACAATCCACCTACAATTGTTCACTATGCTAGTCATGATAAA CCTTG 
GAATACATATAGTATATCTAGACTACGTGAATTATGGTGGGTTTATAGAGATTTGGATTGGTCAGAGATTGCTTTT 
CAACGTTCCGATTTAAArrATTTTGAAAGAAGCAATCAGTCTAAAAAACAAGTGATGCTTGTGACATGGA 
GATATAAAACATTTAGAGTATTTAGTACAACGGTTACCTGATTGGCATTTTCATTTGGCTGCACCGTGTGATT 
TGAGGAGCTGACCTCTCTATCACAGTATACGAATGTAACAGTATATCAAAATGTATTACATAGTAGAATTGATTGG 

30 CTATTGGACGATTCTATAGTTTATTTAGATATTAATACAGGTGGAGAGGTTTTTAATGTAGTTACAAG 

AAAGTGGCAAGAAAATOTCGCTTITGATATCACACGTAAAAGTATGGATGATGGACTCTATGACGGTAI milC 
TGTGGAGAGACCAGATGATTTAGTGGATAGAATGAAGAATATAGAGATAGAGTAA 

4158.2 

35 ATG ACT AAG ATTT ATTCGTC AAT AG C AGT AAAAAAAG G ACT ATTT ACCTCATTTCTACTGTTT ATCTATGT ATTGG 

GAAGTCGTATTATTCTCCCTTTTGTTGACCTAAATACTAAAGATTTTTTAGGAGGTTC 
TCAGCCGCCCTAACAGGTGGGAATCTAAGAAGTTTATCAATTTTTTCTGTTGGATTATCCCCTTG 
TG ATTTT ATGGC AG ATGTTTTCTTTTTCTAAACGGTTGGGTTT AAC ATCT ACGTCT AT AG AAAT AC AAG ATCGCCGT 
AAAATGT ACCTG ACCTTG CTAATTGCTGTG ATTC AATCCTTGG CAGTT AG CTTG AG ACTG CC AGT ACAATCCTCCT 

40 ATTCTGCAATATTGGrrGTTCTAATGAATACAATATTGCTGATAGCAGGAACATTTTT^ 

TTAAATGCGAGTATGGGGATTGGAGGTrCTATTGTAATCCTCCTATCCAGTATGGTTTTAAATATTCCrC AGGAT G 
TTTTGGAAACATTTCAGACAGTACACATTCCAACAGG 
TATTTACTTGCCCnTATGTATCGAGCTCGCTATTTGGTTCCTGTTAATAA 
CTATTCTTATCTCGAAATC^TGTTGAATCCT^^ 

45 CAGCTTATTTGTTCATCTTGTTGGGATTTATTTTCCCTAATCATTCAGGGTTAGC 

GTTGGAAAGCCTTTGTGGGTCTATGTTTATATTTCGGTCTTATTTTTATTTAGTATCAT^ 

AATGGAGAAGAGATTGCAGACCGTATGAAAAAATCTGGAGAATACATTTATGGTATTTATCCAGGTGCGGATACT 
AGTCGATTTATTAATCGATTGGTCCTTCGTTTCrCAGTCATAGGTGGTCTCTTTAATGTGATTATGGCAG 
CATGCTTTTTGTTTTGTTTGATGAAAAGTTATTACGATTGGCAATGATTC^ 
50 TGATTTTTACGATTAGAGACGAGGTCAAGGCITrAAGGCTAAATGAGACCTATAGACCTTTGATTTA^ 

4158.3 

ATGTCCTCTCTTTCGGATCAAGAATTAGTAGCTAAAACAGTAGAGTTTCGTCAGCGTCTTTCCGAGGGAGAAAGTC 
TAGACGATATTTTGGTTGAAGCTTTTGCTGTGGTGCGTGAAGC^^ 

55 TGTTCAAGTCATGGGAGCTATTGTCATGCACTATGGAAATGTTGCTGAGATGAATACGGGGGAAGGTAAGACCTT 
GACAGCTACCATGCCTGTCTATTTGAACGCTTTTTCAGGAGAAGGAGTGATGGTTGTGACT 
TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCXnTrTCTAGGATTGACCATTGGTG 
ATCC\AAGAAGGAGATGAAAGCTGAAGAAAAGAAGCTTATCTATGCTTCGGATATCATCTACACAACCAATACT 
ATTTAGGTTTTGATTATCTAAATGATAACCTAGCCTCGAATGAAGAAGGTAAGTITr^ 

60 OATTATTGATGAAATTGATGATATCTTGCTTGATAGTGCACA 

AGTCTAATTACTATGCGATCATTGATACACrTGTAACAACCTTGGTCGAAGGAGAGGATTATATCm 
GAAAGAGGAGGTTTGGCTCACTACTAAGGGGGCCAAGTCTGCTGAGAATTTCCTAGGGATTGATAATTT 
GGAAGAGCATGCGTCrri^'GCrCGTCATTTGGTTTATGCGATTCGAGCrCATAAGCTCTTTACT 
TATATCATTCGTGGAAATGAGATGGTACTGGTTGATAAGGGAACAGGGCGTCTAATGGAAATGACTAAACTTCAA 

65 GGAGGTCTCCATCAGGCTATTGAAGCCAAGGAACATGTCAAATTATCTCCTGAGACGCGGGCTATGGCCTCGATC 
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ACCTATCAGAGTCnTTTTAAGATGTTTAATAAGATATCTGGTATGACAGGGACAGGTAAGGTCGCGGAAAAAGAG 
TTTATTGAAACTTACAATATGTCTGTAGTACGCATTCCAACCAATCGTCCGAGACAACGGA 
TTGACTATCCAGATAATCTATATATCACTTTACCTGAAAAAGTGTATGCATCCTTGGAGTACATCAAGCAATACCA 
TGCTAAGGGAAATCCTTTACTCGTTTTTGTAGGCTCAGTTGAAATGTCTCAACTCT^ 
5 AAGGGATTGCCCATAATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAGA 
TGGGGGCTGTGACAGTGGCTACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTGGTAAAGGAGTCGCAGAGC 
TTGGGGGCTTGATTGTTATTGGGACTGAGCGGATGGAAAGTCAGCGGATCGACCTACAAATTCGTGGCCGTTCTGG 
TCGTCAGGGAGATCCTGGTATGAGTAAATTTTTTGTATCCTTAGAGGATGATGTTATCAAGAAATTTGGTCCATCT 
TGGGTGCATAAAAAGTACAAAGACTATCAGGTTCAAGATATGACTCAACCGGAAGTATTGAAAGGTCGTAAATAC 

10 CGGAAACTAGTCGAAAAGGCTCAGCATGCCAGTGATAGTGCTGGACGTTCAGCACGTCGTCAGACTCTGGAGTAT 
GCTGAAAGTATGAATATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGGTTCTCGTGACTTA 
GAGGATGTTGTTGTGGATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTAT 
TGTTTCACTTTATTGTGACCAATATTAGTTTTCATGTTAAAGAGGTTCCAGATTATATAGATGTAA 
GCAGTTCGTAGCnTrATGAAGCAGGTGATTGATAAAGAACTrrCTGAAAAGAAAGAATTACT^ 

15 TATATGAACAGTTTTTACX}ACTTTCACTGCITAAAGCCATTGATGACA^ 

ACAGCTATCCATGGCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTA 
CGCGGGCTTTGAAGCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCCTGATGGGGCTGGTTGAGGT 
C ACTCCAAAAG GTG AAATCGTG ACTCATTTTCC ATAA 

20 4158.4 

ATGATAGGGACTTTCGCCGCTGCTCTTGTAGCTGTACTAGCAAATTTCATCGTCCCTATTGAAATTACCCCAAATA 
GTGCCAATACTGAAATTGCACCACCAGATGGGATTGGGCAGGTTCTCAGCAACCTCTTGCTCAAACTGGTTGACA 
ACCCAGTCAACGCCCTGCTTACTGCTAACTATATTAGAATCTTATCTTGGGCAGTCATTm 
AGAAGCCAGTAAAAATAGTCAAGAATTGCTAAAAACTATCGCTGACGTGACTTCTAAAATTGTCGAATGGATCA 

25 CAATCTGGCTCCATTTGGAATCCTTGGTCTTGTTT^ 

TACGGTATTTTATTGGTTCTATTAGTAACGACTATGCTTTTTGTTGCCCCTGTGGTG 
TTTATGAGACGCAATCCTTACCCTCTAGTTTGGAACTGCCTCCGTGTCAGCGGTGTGACAGCCITm 
GTTCTGCGACTAACATTCCTGTCAACATGAAACTCTGCCATGACCTTGGACTCAACCCAGATACCTATTCTG 
TATCCCACTCGGTTCTACTATCAATATGGCTGGAGTAGCGATTACCATTAACCTTTTGACCCTTGCT^ 

30 ACTCTTGGAATTCCTGTTGACTTTGCCACAGCCTTTGTCCTCAGTGTGGTAGCAGCTATCTCATCCTGTGAT^ 
AGGTATTGCCGGAGGTTCCCTCCTTCITATCCCAGTTGCTTGTAGCCTTTTCGGTATTTCT 
AAATTGTTGGGGTTGGTTTTGTGATTGGTGTCATCCAAGACTCATGTGAAACAGCCCTTAACT 
CCTCTTTACCGCCGTTGCCGAATACGCAGCAACCCGTAAAAAATAA 

35 4158.5 

ATGTCTATTAGCCAACGTACGACCAAGCTCATCTTAGCTACCTGTCTTGCCTGCCTGCTTGC TTATT TTCT 

TTCGTCAGCAGTTTCGGCTGGAATTATCGCTCTCTTGAGCCrATCTGATACGCGTAGAAGTACT^ 

CGCAATCGTCTTTTTTCTATGCITCrAGCrCTGGCTATCGGTGTTCT 

GAGTCTCGGCCTCTATCTGGCCTTCTACGTTCCTTTAGCCTACAAGATGGGCTGGGAAATTGGCATCACA C^ 
40 ACTGTTITGGTTAGCCATCTCTrGGTTCAAGAGTCAACCTCTCCAGACCTTCTAGTCAATG 

TATTGGTACAGGATTTGCCTTGCTTGTTAATCTCTATATGCCTTCACGAGAAGAGGAAATCCAGCACTACCACACG 
CTGGTGGAAGAAAAGTTAAAAGATATCCTCCAGCGCTTCAAATACTATTTATCCAGAGGAGACGGACGCAACCGA 
GCACAGCTGGTAGCAGAATTAGACACGCTITTGAAAGAAGCCCTCAGACTGGTCTATTTGGATCACTCrGACCACC 
TCTTTCACCAGACAGACTACCATATCCACTACTTTGAGATGAGACAGCGACAAAGTCG TATCC TGAGAAACATGG 
45 CCCAACAGATTAACACTTGTCACCTTGCCGCCAGTGAAAGCCTGATCTTAGCGCAACTCTTTTC 

TCAACTGAGCCAGACCAATCCTGCTTCTGATTTGCTAGATGAAATTGAACGTTATCTGGAAGTCTTCCGGAACCGC 

AGTCTGCCCAAGACAAGAGAAGAATTTGAAACCCGCGCCACCCTTCTTC 

ATCCAAGTAAAAGTTGATTTTTACCAAAAATATAGACAGTAA 

50 4158.6 

ATGGAAATCATGTCGCTTGCGATTGCTGTTTTTGCCGTCATCATTGGTTTAGTCATTGGATATGTCAGCATCT 
TAAGATGAAATCATCTCAGGAAGCTGCAGAGTTGATGCTTTTAAATGCTGAACAAGAAGCAACTAATTTACGTC 
ACAAGCTGAGCGTGAAGCGGATTTACTTGTTAATGAAGCCAAACGTGAAAGCAAGTCTCTTAAAAAAGAAGCACT 
ATTGGAGGCCAAAGAAGAAGCCAGAAAATACCGTGAAGAAGTGGACGCTGAATTCAAATCAGAACGTCAAGAAC 
55 TCAAACAAATCGAAAGTCGTTTGACAGAGAGAGCTACrAGCCTTGACCGTAAGGACGACAATTTGACGAGTAAAG 
AACAAACACTTGAACAAAAAGAACAAAGTATTTCTGATAGAGCGAAAAACCT^ 

AGGAAGTCGAAAGACAAAAAGAAGCAGAACTAGAGCGTATTGGTGCGCTGTCTCAGGCAGAAGCACGAGATATT 
ATCTTGGCTCAGACAGAGGAAAACTTGACCAGGGAGATTGCCAGTCGCATTCGCGAAGCTGAGCAAGAGGTCAAG 
GAACGTTCTGACAAAATGGCCAAGGACATCTTGGTTCAAGCTATGCAACGTATCGCTGGTGAATATGTAGCGGAG 

60 TC AA C A A ACTC A A C AGTTC AT CTG CG AG ACG AT ACT ATG AAGGG ACG C ATT ATTG GTCGTG AAGGTCGT AAC ATT 

CGTACCTTTGAAAGTTTGACAGGGGTCGATGTGATTATCGACGATACACCAGAAGTGGTGACCTTGTCAGGATTTG 
ATCCGATTCGTCGTGAGATTGCCCGTATGACTATGGAAATGTTGCTCAAAGATGGTCGTATACATCCAGCTCGTAT 
CG AAG AGTTGGTTG AG A AAAACCGTC AAG AG ATTG A C AAT AAG ATTCGTG A AT ACGGTG AG GCTG CTGCCT ATG A 
AATTGGTGCGCCAAACCTTCATCCAGACTTGATGAAGATTATGGGACGTTTGCAGTTCCGTACrTCATATGGACAA 

65 AATGTTTTGCGCCATTCGATTGAGGTTGCTAAGTTGGCTGGTATCATGGCGAGCGAACITGGTGAAAATGCGGCrC 
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TTGCCCGTCGTGCTGGATTCCITCACGATATCGGGAAAGCCATTGACCATGAGGTTGAAGGTAGCCACGTTGAAAT 
CGGTATGGAATTGGCCCGTAAGTACAAGGAACCCCCAGTTGTGGTGAATACGATTGCTAGTCACCACGGAGATGT 
TGAAGCTGAGAGCGTGATAGCAGTTATCGTCGCTGCAGCAGATGCCTTGAGCGCAG CCCGT CCAGGTGCTCGTAG 
TGAGTCTCTTGAAAGCTACATCAAGCGTCTCCATGATTTGGAAGAAATTGCTAACGGCTITGAAGGAGTGCAAACT 
5 AGCTTTGCCCTTCAAGCAGGACGTGAAATTCGTATCATGGTCAATCCAGGAAAAATCAAGGACGACAAAGTCACA 
ATCTTGGCTCACAAAGTTCGTAAGAAAATTGAAAACAATCTCGATTATCCAGGAAATATCAAGGTAACCGTGATT 
CGCG AGCTTCGTGCAGTAG ATTATG CTAAATAA 

4158.7 

10 ATGATGTTAAAACCCTCTATTGATACCnTGCTCGACAAGGTTCCTTCAAAATATTCACTCGTAATCTT 

AACGTGCCCACGAATTGGAAGCAGGTGCCCCAGCAACTCAAGGTTTCAAGTCTGAAAAATCAACTCTrCGCGCTT 
TAGAAGAAATCGAATCAGGAAACGTTACAATTCACCCAGATCCAGAAGGAAAACGTGAAGCAGTGCGTCGCCGTA 
TCG AAG AAG AAAAACG CCG CAAAG AAG A AG AAG AAAAG AAAATC AAAG AG CAAATTGCT AAAG AAAAAG AAG A 
TGGTGAAAAAATTTAA 

15 

4161.1 

ATGTCAGCATATCAATTACCGACCGTATGGCAGGATGAAGCTAGTAATCAAGGAGCT TITAC GGGGCTAAACAGA 
CCAACAGCAGGTGCCCGTTTCGAACAAAACTTGCCAAAAGGAGAACAAGCTTTTCAGCTTTATTCACTGGGAACA 
CCAAATGGTGTGAAGGTTACTATCTTATTGGAAGAArrACTAGAAGCrGGTTTTAAGGAAGCGGOT 

20 ATAAGATTGCTATCATGGATGGGGATCAATTCGGATCAGACTTTGTGAAGCTCAATC CAAAT TCCAAGATTCCAGC 
CTTATTGG ACCAGTCAGGTACTGAAAACGTAAG AGTCTTTG AGTCTGCTCATATTCl'l'Crr 1 ACCTTGCTG AG AAA 
TTTGGAGCCTTTTTACCAAGTAATCCTGTGGAAAAGGTAGAAGTTTTGAATTGGCT 
CACCTTTTCTAGGTGGGGGATTTGGACATTTCTTCAATTATGCTCCTGAAAAATTGGAATATCCT 
ACGATGGAAGTGAAACGCCAGTTGGATTTATTGGATAAGGAATTGGCTCAGAAACCnTATATTGCAGGCAATGAC 

25 TATACGATTGCAGATATTGCTATCTGGTCTTGGTATGGACAGTTAGTTCAAGGAAATCTTTACCAAGGTTCTGCAA 
AATTCTTGGATGCCTCAAGTTATCAAAATCTAGTAAAATGGGCAGAAAAAATTGCCAATCGTCCAGCTGTTAAGC 
GTGGCTTGGAAGTAACTTATACAGAAATTAAATAG 

4161.2 

30 TTGGCAAGCTTGATCACTTCTATCATCATGTTCTATGTCGGTTTCGATGTTCTAAGAGAT^ 

CAGTCGGGAAGAAACGGTCATTGATCCTCTTGGTGCAACTCTAGGAATCATTTCTGCAGCGATTATGTT TGTGG TC 
TATCTCTACAATACTCGCCTCAGTAAGAAATCCAACTCCAATGCGCTGAAGGCAGCTGCTAAGGACAATCTTTCT 
ACGCTGTTACCTCACTTGGAACCGCCATTGCCATCCTAGCTAGTAGTTTCAATTATCCG ATTGT GG ATAA ACTGCT 
TGCTATCATCATCACrTTCTTTATCITGAAGACTGCCTATG 

35 GCTTTGACGACCGCCTGCTCGAGGACTACCAAAAGGCTATCATGGAAATTCCCAAAATCAGCAAGGTC AAATCG C 
AAAGAGGTCGCACCTACGGT AGCAAC ATCTACCTGG ATATTAC ACT AG AG ATG A ATCC TG ACTTGTCTG ill i iGA 
AAGCCATGAAATCGCGGATCAGGTCGAGTCTATGCTGGAGGAGCGTTTTGGCGTCTTTGATACCGATGTCCATATC 
G AACC AG C ACCT ATCCCTG AGG ATG AAATTTT AG AC AATGTCT AT AAAAAATTG CTTATG CGTG AAC AATTG ATTG 
ACCAAGGAAACCAACTAGAAGAACTCITGACTGATGATTITGTCTATATTCGCCAAGATGGAGAGCAGATGGAT^ 

40 AAGAGGCTTATAAGACCAAAAAAGAGTTAAATTCTGCTATCAAGGACATTCAAATTACT^ 

CCAAACTCATCTGCTATGAGTTAGATGGTATCATCCATACCAGTATCTGGCGTCGCCACGAAACCTGGCAAAATAT 
CTTTCATCAAGAAACCAAAAAAGAATAG 

4162.1 

45 ATGACAATTAAACTAGTAGCAACGGATATGGACGGAACCnTCCTAGATGGGAATGGACGCnTGATATGGATCGT 
CTCAAGTCTCTCTTGGTTTCCTACAAGGAAAAAGGGATTTACTTTGCGGTAGCTTCGGG 

TAGAAAAATTATTTGCTGGTGTTCGTGATGACATTATTTTCATCGCGGAAAATGGCAGTITGGTAGAGTATCAAGG 
TCAGGAOTGTATGAAGCGACTATGTCTCGTGACrm^ 

GTAGATATCAATAAACTGCrcnTGACGGGTAAGAAGGGTrCATATGTTCTAGATACGGTTGATGAG ACCTA TTTGA 
50 AAGTGAGTCAGCACTATAATGAAAATATCCAAAAAGTAGCGAGTTTGGAAGATATCACAGATGACATTTTCAAAT 
TTACAACCAACTTCACAGAAGAAACGCTGGAAGATGGGGAGGCTTGGGTAAACGAAAACGTTCCTGGTGTTAAGG 
CCATG ACA ACTGG CTTTG AATCCATTGATATTGTTCTGG ACTATGTCG AT AAGGG AGTGGCCATTGTTG AATTAGT 
TAAAAAACTTGGTATCACAATGGATCAGGTCATGGCITTTGGAGACAATCITAATGACITACATATGATGCAGGTT 
GTGGGACATCCTGTAGCTCCTGAAAATGCACGACCTG AAATTTT AGAATTAGCAAAGACTGTGATTGGTCACCATA 
55 AGGAACGGTCGGTTATAGCTTATATGGAGGGCTTATAA 

4162.2 

ATGGCAGATATAAAATTGATTGCATTGGACTTGGACGGGACCnTGCTGACTACTGATAAAAGGCTGACGGATCGT 
ACCAAGGAAACCTTGCAAGCTGCGCGTGATCGTGGTATCAAGGTCGTATTGACAACTGGTCGTCCCTTAAAAGCC 

60 ATGGATTTCITTCTCCATGAGTTAGGGACrGACGGTCAGGAAGATGAGTATACCATTA 

TTCAGAAAAATACAGGAGAAATCCTTGATAAAACAGTCTTrrCATATGATGATGTGGCACGTTTGTATGAAGAAAC 
AGAGAAATTATCACTGCCTCTTGATGCCATCTCAGAAGGAACAGTTTATCAAATCCAATCGGACCAAGAAAGTCr 
TTATGCCAAATTCAATCCAGCTTTGACCTTTGTTCCAGTGGACTTTGAAGACrTATCT 
AAATGCGTGACTGCCTTTGCTCAAGAACCCXrGGATGCAGCCATTCAGAAGATTTCT 

65 ATG AAATCTTT AAATC ACGTG AAATGTTG CT AG AATGGTC A CC AAAG AATGTTC AT AAAG C AAC AG GTTTGG C AA 
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AACTAATCAGCCATCTTGGAATCGACCAAAGTCAAGTOATGGCTTGTGGTGACGAGGCCAATGACCTCTCTATGA 
TTGAATGGGCAGGTCTTGGTGTTGCTATGCAAAACGCTGTTCCTGAAGTAAAGGCAGCCGCAAATGTAGTGACGC 
CGATGACCAACGATGAGGAAGCTGTCGCCTGGGCTATCGAAGAATATGTGCTAAAGGAGAACTAA 

5 4164.2 

ATGGAAAGTTTACTTATTCTATTATTAATTGCCAATCTAGCTGGTCTCnTrCTGATTTG 

AGGAGAAACACTTAAGTAAGAGCTTGGAGGATCAGGCAGATCATTTGTCAGACCAGTTGGATTACCGCTTTGACC 
AAGCCAGACAAGCCAGCCAGTTAGACCAAAAAGATTTGGAAGTGGTTGTCAGCGACCGTTTGCAAGAAGTGCGGA 
TTGAATTGCACCAAGGTCTGACCCAAGTCCGTCAAGAAATGACAGATAATCTCCTCCAAACTAGAGACAAGACAG 
10 ACCAACGTCTCCAAGCCTTGCAGGAATCAAATGAGCAACGTTTGGAACAAATGCGCCAGACGGTCGAGGAAAAAC 
TAGAAAAGACCTTGCAGACACGCTTACAGGCTTCCTTTGAGACAGTTTCT 
CCTTGGAGAAATGCAGACAGTTGCCCGTGATGTCGGAGCTCTTAACAA 

GATTCTGGGAGAATTGCAACTGGGGCAAATTATTGAAGACATCATGACACCTGCCCAGTACGAACGAGAATACGC 
AACGGTTGAAAACTCTAGTGAACGAGTGGAGTATGCCATCAAGTTACCCGGACAAGGCGACCAAGAATACGTCTA 
15 TCTGCCAATTGACTCTAAGTTTCCACTGGCAGATTATTACCGCTTGGAAGAAGCCTATGAGACAGGTGACAAGGAT 
GAGATTGAACGCTGTCGTAAGTC^CTCCTAGCAAGCGTCAAGCGCnTrGCTAGGGATATTAGGAACAAGTACATA 
GCACCACCTCGGACGACCAATTTTGGAGTTTTGTTTGTTCCGACAGAAGG 

CGGTCTTCTTTG ATG ATTTG AG ACGGG AAGAACAG ATTATTGTTGCAGG ACCAAGTACCCTATCAGCCC ntiiAA 
CTCCCTATCAGTTGGTTTCAAGACCCTTAATATCCAAAAGAGTGCCGACCATATCAGCAAGACTCTTGCCAGTGTC 
20 AAGACCGAGTTTGGCAAGTTTGGTGGTATTCTGGTCAAGGCACAAAAACATCTCCAACATGCCTCnXjGCAATATTG 
ATGAATTATTAAACCGTCGTACCATAGCTATCGAGCGGACGCTCCGTCACATTGAGTTGTCAGAAGGTGAGCCTGC 
GCTTGATCTACTCCATTTTCAAGAAAATGAGGAAGAATATGAAGATTAG 

4164.3 

25 ATGAAGATTAGTCACATGAAAAAAGATGAGTTATTTGAAGGCTTT^^ 

ACTCGAGCTGGGAAAAACTACCTAGCCTTTACCn*CCAAGATGATAGTGGCGAGATTGATGGGA^ 
GCCCAACCTCATAACATTGAGGCCnTrACCGCAGGTAAGGTTGTCCACATGAAAGGACGCCGAGAAGTTTATAAC 
AATACCCCTCAAGTCAATCAAATTACTCTCCGCCTGCCTCAAGCTGGTGAACCCAATGACCCAGCTGATTTCAAGG 
TCAAGTCACCAGTTGATGTCAAGGAAATTCGTGACTACATGTCGCAAATGATTTTCAAAATTGAAAATCCTGTCTG 

30 GCAACGGATTGTCCGAAATCTCTACACCAAGTATGATAAGGAATTCTACTCCTATCCAGCTGCCAAGACCAACCA 
CCATGCCTTTGAAACGGGCTrGGCCTATCATACGGCGACCATGGTGCGTTTGGCAGACGCTATTA^ 
CCTCAGCTCAATAAGAGCCTGCTCTATGCGGGGATTATGTTGCATGACTTAGCTAAGGTCATCGAGTTGACGGGGC 
CAGACCAGACAGAGTACACAGTGCGAGGTAATCTTCTTGGACATATCGCTCTCATTGATAGCGAAATTACCAAGA 
CAGTTATGGAACTCGGCATCGATGATACCAAGGAAGAAGTCGTTTTGCTTCGTCATGTCATCCTCAGTCACCACGG 

35 CTTGCnTGAGTATGGAAGCCCAGTCCGTCCACGCATTATGGAAGCAGAGATTATCCATATGATTGACAATCTGGAT 
GGAAGCATGATGATGATGTCAACAGCTCTTGCTTTGGTGGATA 
GATAATCGTTCCTTCTATAAACCAGATTTAGATTAA 

4166.2 

40 ATGAGTGAAAAAGCTAAAAAAGGGTTTAAGATGCCTTCATCTTACACCGTATTArTGATAATCAT^ 

CAGTGCTAACTTGGTTTATCCCTGCGGGGGCCTTTATAGAAGGTATTTACGAGACTCAGCCTCAAAATCCAC 

GATTTGGGATGTCCTCATGGCACCGATTCGGGCTATGCTAGGTACrCATCCAGAGGAAGGTTCGCTCATTAAAGAA 

ACGAGCGCAGCGATTGATGTAGCCnTCTTCATCCTTATGGTTGGTGGTTTC 

CTCTTGACGTAGGGATTGCCTCTATCGTGAAGAAGTATAAGGGCCGCGAAAAAATGTTAATTTTGGTACTGATGCC 
45 TTTGTTTGCCCTCGGTGGTACAACTTATGGTATGGGTGAAGAAACAATGGCCTTCTATCCACT 
ATGATGGCCGTTGGTTrrGATAGCCTGACTGGTGTTGCAATTATTTTGCTCGGTTCTCAAA 

TACTCTGAATCCATTTGCGACAGGTATTGCTTCAGCGACTGCGGGAGTTGGTACAGGGGACGGTATCGTACTTCGT 
CTGATCTTCTGGGTTACCTTGACTGCTCITAGTACTTGGTTTC 
GACTAAGTCACTGGTTTATAGTACTCGCAAAGAAGATrTGAAACACTTr 
50 TCTACACTTAGCAGCAAACAAAAATCAGTTCTCTTCTTATTTGTGTTGACATTCA 
TCCATGGACAGACCTTGGCGTTACCATTTTTGATGACTTTAATACTTGGTTGA 
TTGTCGGTrCATCTACTTCTGCACTAGGTACTrGGTACTTCCCAGAAGGCGCAATGCTCm 
CTGATTGGTGTTATTTATGGTCTTAAAGAAGATAAGATTATCTCTrCCTTCATGAATC 

TGTTGCCTTGATCGTAGCGATTGCTCGTGGTATTCAAGTTATCATGAACGACGGTATGATTACCGATACAATCCTC 
55 AACTGGGGTAAAGAAGGCTTGAGCGGTCTATCTTCACAAGTCTTTATCGTTGTAACTTATATCTTCT 
GTCATTCITGATCCCATCTTCATCTGGTCTTGCCAGCGCAACTATGGGTATCATGGC 
TCCACITGGAGAATTTGTAAATGTCCGTCCTAGCrTGATTAT^ 

TGATTGCACCAACATCTGGTATTGTGATGGGAGCTCTTGCACITGGACGTATCAACATTGGTACT^ 
CATGGGCAAACTCGTAGTCGCTATTATTGTAGTGACCATCGCCCTTCTTCTCCTTGGAACOT 
60 AA 

4166.3 

ATGAAAATAGATATAACAAATCAAGTTAAAGATGAATTTCT 
TACTCAATGAAGGAGAAAATGGAACACCTTTTGGACAAGCAATCC^ 
65 GTCGAGACATAGGTTTCACTACCTATCTTGACCCTAAAGGTTATTACGGATATGCAGAAATCGGTCAGGGAGCAG 
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AGCTrCTGGCCATTCTCTGTCATTTGGATGTTGTTCCATCAGGTGATGAAGCAGATTGGCAGACACCGCCATTTGA 
AGCAACTATCAAAGACGGCTGGGTATTCGGACGTGGTGTCCAAGATGATAAAGGCCCTTCGCTCGCAGCTCTCTA 
TGCAGTAAAAAGCTTGCTGGACCAAGGTATTCAGTTCAAAAAGCGCGTACGCTTTATCTTTGGTA 
AACCCTCTGGCGCTGCATGGCACGCTACAATACCATCGAAGAACAGGCCAGTATGGGCTTTGCACCTGACTCATC 
5 TTTTCCTCTGACCTATGCTGAAAAAGGGCTTCTACAGGTCAAACTTCATGGCCCTGGATCGGATCA^ 

GAAGTAGGAGGCGCCTTTAACGTTGTACCAGACAAGGCCAACTACCAAGGTCTCCrCTATGAACAGGTTTGTAAC 
GGTCTCAAAGAAGCTGGTTATGATTACCAAACCACTGAACAAACCGTAACGGTTCTCGGAGTGCCAAAGCATGCT 
AAGGATGCTAGTCAAGGTATCAATGCTGTCATCCGACTAGCTACCATTCTTGCTCCTCTCCAAGAACACCCTGCTC 
TCAGTTTTCTTGCAACACAAGCAGGTCAAGACGGCACAGGAAGACAAATCnTTGGTGATATAGCAGATGAACCTT 
10 CTGGTCACCTATCCTITAATGTCGCAGGTCTCATGATCAATCATGAACGTTCTGAAATCCGTATTGACATTCGGAC 
TCCTGTCrrAGCTGACAAGGAAGAACTAGTAGAGTTGCTTACAAGATGTGCACAAAACTACCAACTCCGCTACG^ 
AGAGTTTGACTATCTAGCGCCTCTATACGTCGCAGAAGACAGTAAACTCGTTAGCACACTGATGCAAATCTACCA 
AGAAAAGACTGGCGATAACAGTCCTGCTATTTCATCCGGTGGTGCG\CTTTTC 

GCCTTCGGCGCCTTATTCCCAGGAGCGAAGCAGACAGAACATC^GGCAAATGAATGTGCCGTTCTAGAAGATTTG 
1 5 T ACCGTGCTATGG ATATTTATGCCGAAGCCGTCTATCG ACTTGCAACTTAA 

4169.1 

ATGTCrAATTCATTTGTCAAGTTGTTAGTCTCrCAATTATTTGCAAATTTAGCAGA 
CATTGCTAACATATACATTATITCAAAATCAGTAATTGCCACATCACTA 

20 TTGTTGCGAGTCTTTTAGTTCCGTTGGTTACTAAAA 

AAGACTATATTATTGGCGATACTGGTAGGAATGTTTACCGTAATGCAATCCGTAGCGCCTTTGGTGACCrATCTAT 
TTGTTGTTGCAATTTCCATACTAGATGGTTTTGCAGCACCCGTTTCCTATGCTATTGTGCCACGCTATGCGACCGAT 
TTGGGTAAGGCTAATTCAGCCTTATCAATGACTGGTGAAGCTGTTCAATTGATAGGTTGGGGATTAGGTGGACT 
TGTTTGCAACAATTGGTCTGTTACCTACCACGTGTATCAATTTAGTCTTGTATATCATTTCTAGCTTTCT 

25 TTTCHTCCTAACGCTGAAGTGGAGGTGTTAGAGTCAGAA^ 

TTGCTAGAAATCCTAGATTAAGACTTTTTGTATCAGCAAATTTA 

TCCATTATACTTGin'iU'IGTAACGGAGTTATTAAATAAAACGGAAAGTTACTGGGGATATTCTAATACAGCATACT 
CTATTGGTATTATAATTAGTGGCTTAATTGCTTTTAGGCTATCTGA 

ATTATTCACCCCAAATCTAAAAACCATCCAGAATCCTTGCCTTAGCTTAGATCCTGGATGGTTTC^^ 

30 ATGGGTGTTTTTTACTAGACAAAAAAGAGTTTCCCCTTTATGGTATAAGTGTAGAAAAAAACAC 

AAACTC ACATG AACAGTTTACC AAATC ATC ACTTCC AAAAC AAGTC I'l " 1"! ' 1 ' ACCAACTATCTTTCG ATGG AGGTCA 
TrrAACCCAGTATGGTGGTCTTATCTTTTTTCAGGAACTTTm 
ATTTAGTAACGAATGACCAACGCCGCTACTGTCGTTATTCGGATTCAGATATC 
TTAACAGGTTATGGAACGGACTATGCTTGTAAAGAATTGTCAGCTGATGCCTACTT 

35 TCCAAAATTGTTGGAAGGAGGGCAGCTTGCTTCACAGCCAACCTTATCCCG'rrri'Cl 1 1 CCAGAACTGACGAGGAA 

ACAGTCCATAGTTTGCGATGCCTCAACCTTGAATTGGTCGAATTCTTTTTACAGTTTCACCAGCTAAACC 
TTGTAGATATCGATTCTACCCATTTCACAACTTATGGCAAGCAAGAAGGTGTTGCnTATAACG 
TCATGGCTATCATCCTCTTTATGCTTTCGAGGGGAAGACAGGTTATTGTT^ 
GTTATTGTTCTGAAGAGGCAGACAGCTTTATCACACCTGTTTTAGAACGGTTTAATCAACTTCT 

40 TAGTGGCTITGCGACCCCAAAATTATACGATTTAATTGAAAAAACAGGGCAATACTACCTCATAAAACTCAAG^ 
AAATACTGTTCTGAGCCGTCTTGGAGACCTTTCCCTCCCTTGCCCACAGGATGAGGACTTAACCATCTTGC 
TCCGCCTACTCAGAAACTCTCTATCAAGCAGGATCTTGGTCGCACAAGCGTCGTGTCTGCCAGTTCTCTGAACGAA 
AAGAAGGAAACTTGTTCTACGATGTTATTTCTCTCGTTACAAATATGACGAGTGGAACAAGCCAAGACCAGTTTCA 
GCTTTATCGTGGACGTGGTCAAGCCGAGAATTTCATCAAGGAGATGAAGGAGGGATTTTTTGG CGATAA AACGGA 

45 T AGTTCAAC CTTAATC AAAAACG AAGTTCGT ATG ATG ATG AG CTGT AT CG CCTAC AATCTCTATCTTTTTCTC AAA 

CATCTAGCTGGAGGTGACTTCCAAACnTAACAATCAAACGCTC 

GTGTTCGAACAGGACGCAAGCAGCTCCTCAAATTGTCTAGTCTCTATGCCTATTCCGAATTGTTT^ 

TTCTAGGATTAGAAAAGTCAACCTGAATCTTCCTGTTCCTTATGAACCACCTAGAAGAAAAGCGTCGTTAATC 

CATTAA 

50 

4169.3 

ATGATGGAGTTTTTTCAACAGCTTCCTCATTTAGAGCCATATGGCAATCCTCAGTATTTTG 

AACCTTGCCCATCTTTATAGGTCTCTTTTTCAAGAAACGCTTTGCCTGGTATGAAGTGT^ 

TTGTCACCATGTTGGTGGGTGGAAAGACCAATCAACTAGCTGCCrTGGGTATTTACCTTrGCT 

55 CCTGCTTTTCTACAAGCATTATCGAAAAAGCAAGGATGGCAAGTGGGTCTTCTACTTAGTTAGTTT^ 
CTTCCGATTATCTTTGTCAAGGTGCAACCAGCTATCAATGGAACGCAGTCTTTGCTTGGGTTCT^ 
CCTGACCTTTCGTTCGGTTGGAATTGTCATCGAGCTGAGAGATGGAGTGATTAAGGATTTTACCCTCTGGGAATTC 
CTCCGTTTCCTTCTCTTCATGCCAACTTTCTCGAGTGGTCCAATCGATCGCTTTAAGCGATTTAATGA 
GGCTATTCCTGAGCGAGATGAGTTGATGGATATGCTGGATGAATCTGTCCGCTATATCATGTGGGGCTTTTTGTAT 

60 AAGTTTATCCTAGCTCATGTTTTAGGAGAGACCTTACTACCTCCTCTGAAGAATTTAGC 
TCTTTAATCTCTATGCCTTGGCAGTTATGTATACTTTTGGTCTGGAACTCTTCTTTGA 
TTGCTTTGGCCATCTCAAACTTGATGGGAATCCGTAGCCCTATCAACTTTAACAAGCCCTTTTTA 
AAAGGAGTTTTGGAATCGCTGGCATATGAGTCTGTCCTTCTGGTTCCGTGACTTTGTCTTTATGCGAATGGT 
GTGTTAACCAGAAAGAAAGTCTTTAAAAATCGTAATGTAACCTCAAGCATGGCCTACATTGTAAATATGCTGATTA 

65 TGGGATTTTGGCATGGTGTGACCTGGTACTATATCGCCTATGGACTCTTTCATGGACTAGGCTTGGTCATCAATGA 
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TGCCTGGGTTCGCAAGAAAAAAACGCTCA ATAA GGAACGGAAAAAAGCAGGGAAGGCT CCCCTA CCTGAGAATC 

GCTGGATTCAGTTGCTTGGCATGGTTGTCACTTTCCA 

AT AATCT ATG GTTT AAAAAATAA 

4169.4 

ATGCTTAAACGCrTATGGATGATCTTCGGACCGGTCTTGATCGCTGGTTTGTTGGTTTT^ 

CCTACTGAGATGCATCATAATCTAGGAGCTGAAAAGCGTTCAGCAGTGGCTACTACTATCGATAGTTTTAAGGAGC 

GAAGTCAAAAAGTCAGAGCACTATCTGATCCAAATGTGCGTTTTGTTCCCTTCTTTGGCTCT^ 

TTTTGACGGTGCTCATCCTGCGGTATTAGCTGAGAAATACAATCGTTCCTACCGTCCTTATCT^ 

GGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATGTTACG\CAGCTGGAGAATAAACAAGTTGTGTAT 

GTTATCTCACCTCAGTGGTTCAGTAAAAATGGCTATGATCCAGCAGCCTTCCAGCAGTATTTTAATGGAGACCAGT 

TGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGCTAGTCAATATGCAGCGACTCGCTTACT^ 

AAACGTAGCTATGAAGGACCTGGTTCAGAAGTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGAT 

TGAATTATTGGCTCGTTTTAATGAACGCCAAGCrrCCTTTTTTG 

ATAAGCATGTAGCTAAGTATTTAAAAATCnTGCCAGACCAGTTTTCTTATCAGGCAATAGAAGATGTTGTCA^ 
AGATGCTGAAAAAAATACrrCCAATAATGAGATGGGAATGGAAAATTATTTCTATAATGAGCAGATCAAGAAGGA 
TTTGAAGAAATTAAAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCCAGAGTATAATGAOT 
TTAACACAGTTrrCTAAATCTAAGGTAAACCCGATTTTTATCATTCCACCTGrTAAT 

CTGGTCTACGAGAGGATATGTACCAACAAACGGTGCAGAAGATTCGCTACCAGTTAGAAAGTCAAGGTTTTACCA 
ATATAGCAGATTTTTCTAAGGACGGCGGGGAGCCTTTCTTTATGAAGGACACCATTCACCTTG^ 
GTTGGCnTITGACAAGGCAGTTGATCCTTTCCTATCCAATCCCACACCAGCTCCGACTTACCATCTGAATGAGCGC 
TTTTTCAGCAAAGATTGGGCGACTTATGATGGAGATGTCAAAGAATTTCAATAG 

4169.6 

ATGGAGAAAAACCTCAAGGCTTTGAAACAAACAACAGACCAAGAAGGCCCAGCAATTGAACCTGAAAAGGCAGA 
GGATACCAAGACAGTCCAAAATGGTTACTTCGAGGATGCAGCTGTCAAGGACCGCACCTTGAGTGACTATGCAGG 
TAACTGGCAATCAGTTTATCCmCCTTGAAGACGGCACGTTTGACCAAGTCm 

GGTAAGATGACCCAGGCTGAGTACAAGGCTTACTATACAAAAGGCTATCATACAGATGTGACTAAGATTAACATT 

ACTGATAATACTATGGAATTTGTTCAAGGTGGACAAAGCAAGAAATACACITACAAGTATGTCGGTAAGAAAATT 

TTGACITACAAGAAAGGCAATCGTGGCGTGCGTTTCCrCTTTGAAGCCACAGATGCTGACGCrGGACAAT^ 

ATGTTCAGTTTAGTGACCACAATGTTGCCCCAGTTAAGGCAGAACATTTCCATATCTTCTTTGGA^ 

AGAAGCCCTCTTTGAAGAAATGGACAACTGGCCAACCTACTACCCAGATAA 

AGAAATGTTGGCGCATTGA 

4170.3 

ATGAAAGATGGTCATTTGCTAGCCCATCATATTCGTTTGTTGAATGGGCGGATTTT^ 

ATCCTGAGGCTCTTTATAGGGGTGAACAGGGCAAGATTTTAGCGGTTTTATGGAATAGTGAAACT^ 

CTG CG A C AG AT ATCGCGCTTGCG A CTGG ACTTGCG AAT AAT ACGCTG A CG ACT ATG AT AAAAAAGCT AG AGG A AC 

AAAAGCTTGTAATTGTTAGTCCGTGTGGAAAAGACAAGCGTAAGAAGTATTTAGTTTTA^ 

CCCAGAAAGAAGTGGGGCATCGTGTCAGTCAGAAATTGGATACTATCTTTTACAAAGGATTTTCAGAGGAAGAAA 
TT CAC C AATTTG AAG GTTTTC AAG AAAG A ATTTTG G CG AAT CTG AAAG AG AAG G G AAATG AG GTTT AG 

4170.4 

ATGACTAATTTAATTGGAACTTTTCAGGATCGTTTTAGTC 
GCTTTTGACCTTGTTACrAGCTATTTTGCrrGCGATTCCOT 

ACTGGGTCITGCAGATTGCAGGTATTTTCCAGACCATCCCGTCTCTGGCCITGTTGGGGCTCTTTATCC 
GGAATTGGGACCTTGCCGGCTrTGACAGCrCTAGTGATTTATGCGATTTTCCCrATTT^ 

GGCTGAAGGGAATTGATCCGAACCTGCAAGAGGCTGGGATTGCCTTTGGGATGACCAGATGGGAACGTCTCAAGA 

AATTTGAAATTCCACTCGCCATGCCTGTTATCATGTCTGGGATTCGGACGGCAGCTGTTTTGATTATCGGTACGG^ 

AACCITGGCGGCCnTGATTGGTGCAGGGGGACTAGGTTCCTTTATrCITlTGGGAATTGACCGTAATAATGCCAG 

TTGATTTTGATTGGGGCACTTTCTTCTGCAGTGCTAGCCATTC 

CAAAATTACGGACGATTTTCTCAGGTTTTGCCrrGGTGGCTTTAT^ 

GTTCAAAAAGAGAAGGAAAACTTGGTTATTGCrGGGAAAATAGGTCCAGAACCAGAAATTTTGGCCAATATGTA 

AAGTTGCTGATTGAAGAAAATACCAGCATGACrGCGACTGTTAAACCGAATTTrGGGAAGACAAGOT 

AAGCTCTGAAAAAAGGCGATATTGACATCTATCCTGAATTTACTGGTACGGTGACTGAAAGTTTGOT 

ACCCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTGGCGCGTGATGGCATTGCTAAGCAGGATCATCTAGC 

CTATCrCAAACCCATGTCTTATCAAAACACCTATGCTGTAGCTGTTCCGAAAAAGATTGCTCAAGAATATG 

AAGACCATTTCAGACITGAAAAAAGTGGAAGGGCAGTTGAAGGCAGGTTTTACACTCGAGTTTAACGACCGTGAA 

GATGGAAATAAGGGCTTGCAATCAATGTATGGTCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCTTCGCTATC 

AGGCTATTCAGTCAGGGGATATrCAAATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACA 

GGTCITGGAAGATGACAAGCAACrCTTCCCACCTTATCAAGGGGCTCCACTCATGAAAGAAGCTOT 

ACACCCAGAGTTGGAAAGAGTTCTTAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTA 

C C AAGTCGGTGTTG AAGGC AAGTC AG C AAAG C AAGT AGCC AAGG AGTTTCTCC AAG AAC AAG GTTTGTTG AAG AA 

ATG A 
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4170.5 

ATGATGCATACTTATTTGCAAAAGAAAATTGAAAATATCAAAACAACCCTAGGTGAAATGTCAG GTGG TTACCGT 
CGTATGGTTGCGGCTATGGCTGATTTAGGATTTTCAGGAACTATGAAGGCrATCTGGGATGACCTCTTTGC 
GTAGTTTrGCCCAGTGGATTTATTTGCTGGTTTTAGGAAGTTTTCCTCTCTGGCT 
5 ATTGTTGACTGGATTGGGATGATTTGTAGCTTGACAGGGATTATCTGTGTAATCITTGTA TCGGAA GGTCGAGC^ 
GTAATTATCTTTTTGGCTTGATTAACTCTGTTATTTACCTTATTTTG 

CTGACGACACTTTACTrCACAGTCATGCAGCCAATTGGACTTCTAGTTTGGATTTATCAGGCACAG 
. AAAAGCAGGAGTTTGTCGCGCGTAAACTGGACGGCAAGGGCTGGACAAAGTATCTTTCCATTAGTGTGCTTTGG 
GGTTGGCCTTTGGCTTCATTTATCAGTCTATTGGTGCCAATCGTCCCTATCGTGATTCAATCACAGATGC AACCA A 
10 TGGGGTAGGGC^AATCCTCATGACAGCTGTTTACCGTGAACAGTGGATATTCTGGGCGGCTACCAATGTCTTTTCA 
ATCTATCTCTGGTGGGGAGAAAGCCTGCAAATTCAAGGGAAATATCrAATTTATCTCATTAACAGTCrAGTTGGTT 
GGTATCAATGGAGCAAGGCAGCTAAGCAGAATACTGATTTACTTAACTAG 

4170.6 

1 5 ATGAG AAATATG AAGGCAAAATATGCTGTTTGGGTGG IT ITHT I'CTT AAATTTG ACTTATGCCATTGTTG AGTTTAT 

TGCAGGTGGAGTATTTGGTTCTAGCGCTGTTCTTGCTGACTCTGTGCATC 

AT ATC AGCTTTTCTAG AAAC AATCTCC AATCGTG AAG AAG ACAATCAGT ACAC CTTGG G CT AT AAG CGGTTTAG CC 
TGCTAGGAGCCTTGGTAACAGCTGTGATTCTCGTAACGGGCTCTGTTCTAGTCATTTTGGAAAATGTCACGAAGAT 
TTTG CATCCGCAACCAGTCAATG ATGAG G G G ATTCTCTG GTT AG G AATTATTG CG ATT ACTATC AATCTGTTAGCG 
20 AGTCrGGTGGTTGGTAAGGGAAAGACAAAGAATGAGTCTATTCTGAGTCTGCATTTTCTGGA AGATA CGCTAGGG 
TGGGTAGCTGTTATCCTGATGGCGATTGTTCITCGATrrACGGACTGGTATATCC TAGA TCCT^ 
CA' lTrClTl ' ClTl ATTCTTTCAAAAGCCCTTCCAre^ 

AAGGTCTTGATATCAAGCAAGTAAAGAGTGGCCTGGAGCGATTGGACAATGTGGCCAGCCTTAATCAGCTTAATC 
TCTGGACTATGGATGCITrGGAAAAAAATGCCATTGTCCATGTTTGTCrAAAAGAAATGGAACATATGGAAACTT 
25 ■ TAAAGAGTCTATTCGAATTTTCCTAAAAGATTGTGGrrTTCAAAATATTACCATTG 

ACTCACCAAACCCATAAGCGAAAGGTGTGTGACTTGGAACGGAGTTATGAGCATCAACATTAG 

4170.8 

ATGATTGAATACAAAAATGTAGCACTGCGCTACACAGAAAAGGATGTCTTGAGAGATGTCAACTTACAGAT TGAG 
30 GATGGGGAATTTATGGTTTTAGTAGGGCCTTCTGGGTCAGGTAAGACGACCATGCTCAAGATGATTAACCG TCn^ 
TGGAACCAACTGATGGAAATATTTATATGGATGGGAAGCGCATCAAAGACTATGATGAGCGTGAACTTCGTCTTT 
CTACTGGTTATGTTTTACAGGCTATTGCTCTTTTTCCAAATCTAACA 

ATGAAGGGGTGGAGCAAGGAAGAAATTACGAAGAAAACAGAAGAGCTTTTGGCTAAGGTTGGTTTACCAGTAGCC 
GAGTATGGGCATCGCTTACCTAGTGAATTATCTGGTGGAGAACAGCAACGGGTCGGTATTGTCCGAGCTATGATTG 
35 GTCAGCCCAAGATTTTCCTCATGGATGAACCCTTTTCGGCCTTGGATGCTATTTCGAGAAAACAGTTGCAGGT^ 

GACAAAAGAATTGCATAAAGAGTTTGGGATGAG\ACGATTTTTGTAACCCATGATACGGATG AAGC CTTGAAGTT 
GGCGGACCGTATTGCTGTCTTGCAGGATGGAGAAATTCGCCAGGTAGCGAATCCCGAGACAATTTTAAAAGCGCC 
TGCAACAGACTTTGTAGCAGACTTGTTTGGAGGTAGTGTTCATGACTAA 

40 4171.1 

ATGTCAGCAGTTGCTATTTCAGCTATGACCAAGGTTATGCAAGAAACCCACGGAAATCCTTCTAGTATTCATGGTC 
ATGGTCGTCAAGCTGGCAAACTCTTGCGAGAAGCCCGTCAGGAACTAGCCCAGTTACTAAGGACAAAACCTCAAC 
ATATCTTTTTCACnTCTGGTGGGACTGAAGGCAATAATACTACCATCATTGGCTACTGTCTTCGTCACCAAG AA 
AGGAAAACATATCATCACAACTGCCATCGAGCACCATGCTGTCCTTGAAACAATTGATTACTTGG TrCA ACACTTT 

45 GGGTTTGAAGCAACCATTATCCAGCCAGAAAATCAAGAAATCACAGCCCAGCAAATTCAAAAGGCTTTACGTGAC 
GATACGATTTTGGTTTCTACCATGTTTGTCAATAATGAGACAGGAAACCTACTGCCCATCGCTGAuAATTGGCCAAA 
TACTCAAGCAACACCCTGCTGCCTATCATGTTGATGCAGTTCAGGCn'ATTGGTAAAATC CCAAT TCATTCAGAAGA 
ATTGGGCATTGATTTTCTCACTGCTTCTGCCCACAAATTCCATGGTCCTAAGGGAATCGGTT^ 
GCATGGACTTTGATTCCTATCTACATGGCGGAGACCAGGAACAGAAAAAACGTGCAGGAACTGAAAATCTGCCTG 

50 CCATTGTAGGCATGGTTGCAGCCCTAAAAGAAGACCTAGAAAAACAAGAAGAACATTTTCAACATGTACAAAATC 
TAGAAACTGCCTTTCTGGCAGAGCTGGAGGGCATTCAGTATTACCrGAATAGAGGAAAACATCATCTCCCTTATGT 
TCTCAATATTGGATTTCCTGGTCAGAAAAATGACCTCTTACTCCTTCGGCTAGATTTAGCTGGA^ 
CTGGCTCAGCCTGTACTGCAGGCGTTGTCCAATCCAGCCATGTTCTTGAAGCCATGTATGGCGCAAATTCAGAACG 
CTTGAAGGAATCCCTTCGCATCAGTTTGTCGCCACAAAATACCGTTGAAGACCTACAAACCCTCGCAAAAA^ 

55 AAAGAAATTATCGGAGGTTAG 



60 4172.1 

ATGTTATTCAAATTATCTAAGGAAAAAATAGAGCTAGGCTTATCTCGTTTATCGCCAG 
GTTTTGCCTTGGTCATTTTACTAGGCTCTCrrCTTTTGAGOT 

TATTTTGATCATCTTITCACTGCTGTCTCTGCAGTCTGTGTGACGGGTCrCTCAACCCTrCCAGTAGCT 
TAATATCTGGGGTCAAATAATCTGTTTGCTCTTGATTCAGATCGGTGGTCTAGGGCTCATGACCTTTATTG 
65 TCT AT ATCC AG AG C A AG C AA AAG CTT A GTCTTCGT AG CCGTG C AACT ATTC AGG AT AGTTTT AGTT ATGG AG AAAC 
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TCGATCTTTGAGAAAGTTTGTCTATTCTATTTTTCTCACGACCnTrTTGG 
GTTTTCGCCTTATTCCTCAACTTGGCTGGGGACGTGGT 

AATGCCGGTTTTGATAATTTAGGGAGCACCAGTTTATTTGCTTTTCAGACCGATTTACT 

AGG CTTG ATTATTAC AGG CGGCCTTGGTTTTATGGTCTGGTTTG ATTTGGCTGGTC ATGTAGG AAG AAAG AAAAAA 
5 GGACGTCTGCACTTTCATACGAAGCTTGTACTATTATTGACTATAGGTTTGTTGTTATTTGGAA 

CTTTCTTGAGTGGAACAATGCTGGAACGATTGGCAATCTCCCTGTTGCCGATAAGGTTTTAGTT AG 
ACAGTGACGATGCGAACAGCTGGCTTTTCTACGATAGATTATACTCAGGCTCATCCTGTGACTCT^ 
TCTTACAGATGTTTCTAGGTGGGGCACCTGGAGGAACAGCTGGGGGACTCAAGATTACGACA'l'l'rrrTGTCCTCTT 
GGTCTTTGCACGAAGTGAGCTTCTAGGCnTGCCTCATGCCAATGTTGCGAGACGAACGATCGCGCCGCGAACGGTT 

10 CAAAAATCCTTTAGTGTCTTTATTATCTTTTTGATGAGCI^ 

AGGCAATCCTCCCTTTATCCACCTCGTATTTGAAACCATrrCAGCTCTTAGTACAGTTG 
ACTCCTGACCTTGGGAAATTGGCTCTCAGTGTTATCATGCCACTTATGTTTATGGG 

ACGAATTGGTCCCTTGACCTTGTTTGTTAGCTTGGCAGATTACCATCCAGAAAAGAAAGATATGATTCACTATA 
AAAGCAGATATTAGTATTGGTTAA 

15 

4172.2 

ATGTCAGATCGTACGATTGGAATTHGGGCTTGGGAATTTTO 

ATATGAATATTATCGCTATTGATGACCACGCAGAGCGCATCAATCAGTTTGAGCCAGTTTTGGCGCGTGGAGTGAT 
TGGTGACATCACAGATGAAGAATTATTGAGATCAGCAGGGATTGATACCTGCGATACCGTTGTAGTCGCGACAGG 

20 TGAAAATCTGGAGTCGAGTGTGCTTGCGGTTATGCACTGTAAGAGTTTGGGGGTACCGACTGTTATTGCTAAGGTC 
AAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGAGTATGAAATGGGG 
CAGTCTCTAGCACAGACCATTCTTTTCCATAATAGTGTTGATGTCITrCAGTTGGATAAAAATGTGTCTA 
AGATGAAAATTCCTCAGTCTTGGGCAGGTCAAAGTCTGAGTAAATTAGACCTCCGTGGCAAATACAATCTGAATA 
TTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGAATTTGGACCAGATGACCTCTTGAAAGCAGATAC 

25 CTATATTTTGGCAGTCATCAACAACCAGTATTTGGATACCCTAGTAGCATTGAATTCGTAA 

4172.3 

ATGAAGTTATTGTCTATCGCAATTTCTAGCTATAATGCAGCAGCCTATCTTCATTACTGTGTGGAGTCGCTAGTGA 
TTGGTGGTGAGCAAGTTGGGATTTTGATTATCAATGACGGGTCTCAGGATCAGACTCAGGAAATCGCrrGAGTGTTT 

30 AGCTAGCAAGTATCCTAATATCGTTAGAGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGG 
CTTGGTAGAGGOTCTGGGCGCTATTTTAAAGTAGTTGACAGTGATGACTGGGTGGATCCTC 
ATTCITGAAACCTTGCAGGAACITGAGAGCAAAGGTCAAGAGGTGGATGTCTTTGTGACC AATTT TGT 
AGGAAGGGCAGTCTCGTAAGAAGAGTATGAGTTACGATTCAGTCTTGCCTGTTCGGCAGATTTTTGGCTGGGACCA 
GGTCGGAAATTTCTCCAAAGGCCAGTATACCATGATGCACTCGCTGATTTATCGGACAGATTTGTTGCGTGCrAGC 

35 CAGTTCTAA 

4172.4 

ATGAAATTCAATCCAuAATCAAAGATATACTCGTTGGTCTATTCGCCGTCTCAGTGTCGGTGTTGCCTCAGTTGTTG 
TGGCTAGTGGCTTCTTTGTCCTAGTTGGTCAGCCAAGTTCn'GTACGTGCCGATGGGCrCAATCGAACCCCAGGTC^ 
40 AGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTTATCAGAAAAACCAGGAGACACCGTTCTCACTCA 
AGCGAAACCTGAGGGCGTTACTGGAAATACGAATTCACTTCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGA 
AACAAGCCCTTCTAGTCTGGATACACTTTTTGAAAAAGATGAAGAAGCTCAAAAAAATCCAGAGCT 
CTTAAAAGAAACTGTAGATACAGCTGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTGAACAAGT 
AAAAGGTGGAGTGAAAGAAAATACAAAAGACAGCATCGATGTTCCTGCTGCTTATCTTGAAAAAGCTGAAGGGAA 

45 AGGTCCTTTCACTGCCGGTGTAAACCAAGTAATTCCTTATGAA 

TTACTAAAAGCTTCGGATAATGCTCCTTGGTCTGACAATGGTACTGCTAAAAATCCTGCTT^ 

GATTAACAAAAGGGAAATACTTCTATGAAGTAGACTTAAATGGCAATACTGTTGGTAAACAAGGTCAAG 

TTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAAGTTTACGGAA 

TGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATTAGTTGCTAAAGAAACAGTTCAA 
50 AAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTGAAGGT 
CCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGCATGTTGACTCGTCTCTTGC 
TCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAA 
ACGTGAAGACCAAAGGTCAATACTTCTATCAAGTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGC 
TCATTGACCAGTTCCGAGCAAATGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGTA 
55 AACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACT ATT AACATAAACGGTTTAATTTCT AAAG AAACAG 

TTCAAAAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTG 
AAGGTCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGTATGTTGACTCGTCT 
CTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGTGACGCTAAAAACCCAGCCCTATCTCCACTAGG 
TG AAAACGTG AAG ACC AAAGGTC A AT ACTTCT ATC A ATT AG CCTTGG ACGG A A ATGT AG CTG G C A AAG AAAAAC A 
60 AGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGA 
CGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCT 
AACAGTTCAAAAAGCCGTTGCAGACAACGTTAAGACAGTATCGATGTTCCAGCAGCCTACCTAG 

4172.5 
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ATGAAACTAAAAAGTTATATTTTGGTTGGATATArrATTTCAACCCTCTT 
TCAAAAAATGCTGATTGCGAAAGGCGAGATTTACTTTTO 

GGGATTAGTCTCTTTCTCCTATTGCCAGTCTTTACGTCGTTGGGCAAACTCAAGGAGCATGCCAAGCGGGTAGCGG 
CCAAGGATTTTCCTTCAAATTTGGAGGTTCAAGGTCCTGTAGAATTTCAGCAATTAGGGCAA^ 
5 GTCCCATGATTTGCAGGTAAGCTTTGATTCaTGGAAGAAAGCGAACGAGAAAAGGGCTTGATGATT^ 

GTCGCATGATATTAAGACTCCTATCACTTCGATCCAAGCGACGGTAGAAGGGATTTTGGATGGGATTATCAAGGA 
GTCGGAGCAAGCTCATTATCTAGCAACCATTGGACGCCAGACGGAGAGGCTCAAT AAACTG GTTGAGGAGTTGAA 
TTTTTTGACCCTAAACACAGCTAGAAATCAGGTGGAAACTACCAGTAAAGACAGTATTTT^ 
ATTGAGTGCATGAGTGAATTTCAGTTTTTGATTGAGCAGGAGAGAAGAGATGTCCACTTGCAGGTAATCCCAGAGT 
10 CTGCCCGGATTGAGGGAGATTATGCTAAGCTTTCTCGTATCnX3GTGAAT 

TGCTCCAGGAACCAAGCTGGAAGTGGTGGCTAAGCTGGAGAAGGACCAGCTTTCAATCAGTGTGACCGATGAAGG 
GCAGGGTATTGCCCCAGAGGATTTGGAAAATATTTTCAAACGCCTTTATCGTGTCGAAACTTCGCGTAACATGAAG 
ACAGGTGGTCATGGATTAGGACTTGCGATTGCGCGTGAATTGGCCCATCAATTGGGTGGGGAAATCACAGTCAGC 
AGCCAGTACGGTCTAGGAAGTACCTTTACCCTCGTTCTCAACCTCTCTGGTAGTGAAAATAAAGCCTAA 

15 

4172,6 

ATGTTTGGTCAAACGGCTCAACATGGTCTTACGAATAGCCTGAAAGACT 

CACAATTGGCGTTTTTTTGCCAGATGCTCCGCTGTTCCAGATCGGTTGAGCAGGGTACTGGAAATCACCGTCOT 
GTTCAATATGATTCAGCAGATATTCTCGCATTTTGGGATGACTCACTTGGGACAAATCAAGTTGGTCTATCAAGAG 
20 TCGATTGACCTTGAGTTGCTGGTCAATGCACTTAATCATCACTTGCTCATTGACAGACTGGTCCTCACGCCCAATC 
AAATAACGATAGAAATCGACAGGCAGATAGTACATGGTCTTGACCTGCTGAAGGGGCGTAAAGACAAAGAGATTA 
TCGACATAAAAAGTATGTTCAGGCAGTTAGAACTGGCTAGCACGCAACAAATCTGTCCGATAAATCAGCGAGTGC 
ATCATGGTATACTGGCCTTTGGAGAAATTTCCGACCTGGTCCCAGCCAAAAATCTGCCGAACAGGCAAGACTGA 

25 4174.1 

ATGGAACATITAGCAACTTATTTTTCAACCTATGGAGGAGCTTTCTrCGCTG 

GATTAAGCGGTATGGGGTCTGCTTATGGAGTTGGTAAGGCTGGGCAATCTGCCGCAGCTT TACTG AAAGAACAGC 
CTGAAAAGTTTGCCTCAGCTTTGATATTGCAATTATT^ 

TTTAATTTGGTTGCAATTAACTCCAGAACTTCCTTTAGAAAAAGGCGTTGCTT 
30 CTATTGTAGGATACTTTTCAGCTAAGCATCAAGGAAATGTAGCAGTAGCGGGAATGCAAA TCTTG GCTAAAAGAC 
CAAAAGAATTCATGAAGGGAGCAATTTTAGCTGCCATGGTAGAAACCTATGCAATTCTTGCTTTTGTCGTATC 
CATTTTGACCCTTCGTGTATAA 
4175.2 

ATGTTAAAATCAGAAAAACAATCACGTTATCAAATGTTAAATGAAGAATTGTCCTTCCTATTGGAAGGCG AAACC 
35 AATGTTTTGGCTAATCTTTCCAACGCCAGTGCTCTCATAAAATCACGTTTTCCr AATACCGTATTTGCAGGC 1 1 1 1 A 

TTTGTTCGATGGAAAGGAATTGGTTTTAGGCCCCTTCCAAGGAGGTGTTTCCTGCATCCGTATTGC 
GGTGTTTGTGGTGAGGCAGCTCACTITCAGGAAACTGTTATTGTTGGAGATGTGACGACCTATCT 
CTTGTGATAGTCTAGCTAAAAGTGAAATTGTGGTGCCGATGATGAAGAATGGTCAGTTACTTGG AGTTC TGGATCT 
GGATTCTTCAGAGATTGAGGATTACGATGCTATGGATCGAGATTATTT^ 
40 AAGACAGCATGGGACTTTACGATGTTTGAGGAAAAATCTTAA 

4175.3 

ATGTCAGTATTAGAGATCAAAGATCTTCACGTTGAGATTGAAGGAAAAGAAATTTTAAAAGGGGTTAACCTGACC 
CTGAAAACAGGAGAAATTGCCGCTATCATGGGACCAAATGGTACAGGTAAATCGACTCTTTCTGCCGCTATCATG 

45 GGAAATCCAAACTATGAAGTAACTAAAGGTGAAGTTTTGTTTGATGGCGTAAACATCCTTGAGTTGGAAG 

GAGCGTGCGCGTATGGGACTTTTCCTTGCTATGCAATACCCATCAGAAATCCCTGGAATTACCAATGCTGAGTTTC 
TTCGTGCCGCTATGAATGCGGGTAAAGAAGATGATGAGAAGATTTCAGTTCGTGAGTTTATTACTAAGCTAGATGA 
AAAAATGGAATTGCTCAACATGAAAGAAGAAATGGCAGAGCGTTACCTCAACGAAGGCTTCTCTGGTGGTGAGAA 
AAAACGCAATGAAATTCTTCAACTTTTGATGTTGGAGCCAACATT^ 

50 GATATTGACGCTCTrAAAGTTGTGTCTAAAGGTGTCAATGCCATGCGTGGTGAAGGTTTTGGTGCTATGAT CATC A 
CTCACTACCAACGT C I ' l ' I ' l G AACTATATCACACCTG ATGTGGTACACGTGATG ATGG AAGGTCGTGTTGTCCTTTC 
TGGTGGTCCAGAATTGGCTGCGCGTTTGGAACGTGAAGGATACGCAAAATTAGCTGAAGAACTTGGCTACGACTA 
CAAGGAAGAATTGTAA 

55 4174.4 

ATGCCCTACAAAAGACAAAGGAGTTTTTCAATGGCACTTTCTAAACTA 

ACCATTCGAAAAATCCACATCACCAAGGGAAGTTAGAAGATGCTGAGCAAATCAGTCTCAACAATCCGACTTGTG 
GGGATGTCATCAACCTCTCTGTCAAGTTTGATGCAGAGGACCGTTTGGAAGATATTGCTTTTCTA AATTC AGGAT^ 
C ACG ATTTCAACTG CTTCTG CT AGT ATG ATG AC AG ATG CCGTTTT AGG A AAAACCAAAC AAG AAATTTT AG A ACTG 
60 GCGACTATTTTTTCTGAAATGGTTCAAGGGCAAAAAGATGAGCGTCAAGACCAACTTGGAGACGCGGCATTCTTG 
TCAGGTGTTGCCAAATTCCCTCAAAGAATCAAGTGTGCAACCCT^ 
AATCAAGAAAAACAGTAA 

4175,5 
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ATGAAAATTCAAGACCTATTGAGAAAAGATGTCATGTTGCTAGATTTGCAGGCAACTGAAAAAACAG CTGT CATC 

GACGAGATGATTAAAAATTTGACAGACCACGGTTATGTAACAGATTTTGAAACATTTAAAGAAGGAAT^ 

CGTGAAGCTTTGACITCTACrGGTTTGGGTGATGGAATCGCAATGCCT 

GCGACAGTTCTATTTGCTAAGTCAAATAAGGGTGTTGACTACGAGAGCTTGGATGGACAAGCAACTGACCT 
5 TCATGATTGCAGCTCCAGAAGGTGCCAATGATACTCACTTGGCAGCCTTGGCAGAATO 
AGACGGTTTTGCAGACAAACTTCGTCAAGCAACATCTGCAGACCA^ 
AAAACTGAGGAACTTGTTCAAGCACCTGCTAATGACrCrGGTGACTTTATCGTAGCTGTT^ 
GTATTGCCCACACTTACATGGCCCAAGAAGCCCTTCAAAAAGTAGCTGCTGAAATGGGGGTTGGTATCAAGGTCG 
AAACCAACGGTGCTAGCGGTGTTGGAAATCAACTAACTGCAGAAGATATCCGTAAGGCTAAAGCTATTATCATTG 

10 CAGCAGACAAGGCCGTTGAAATGGATCGATTTGATGGAAAACCATTGATCAATCGTCCAGTTGCTGACGGTATCC 
GTAAGACAGAAGAGCTAATTAACTTGGCTCTTTCAGGAGATACTGAAGTCTACCGTGCCGCT^ 
CTGCAACAGCCTCTAACGAAAAACAAAGCCTTGGTGGTGCCTTGTACAAACACTTGAT GAGTG GTGTATCTCA 
TGTTACCATTCGTTATCGGTGGTGGTATCATGATTGCCCTTGCCTT^ 
GAAAACCTTGGCAATCTTGGTTCTTACCATGAGTTAGCTTCTATGTTCATGAAAA 

1 5 GATGCTTCCAGTCTTTGCGGGTTATGTTGCCTACTCTATTGCTG 

GGTGCTATTGCCAAAGAAGGTTTTGCCTTTGGTAAAATTCCTTATGCCGCAGGTGGTGA^ 

CAGGTGTCTCATCTGGTTTCCTAGGTGCCCTTGTTGGTGGATTTATCGCAGGTGCCTTGGTTCTTGCCATCAAGAAA 
TACGTTAAAGTTCCTCGTTCACTCGAAGGTGCTAAATCAATCCTTCT 

GATTTGTTATGCTAGCTGTGAATATCCCAATGGCTGCAATCAACACTGCTATGAATGACTTCCTAGGCGGTCTTGG 
20 AGGAGGTTCAGCTGTCCnTCTTGGTATCGTCCTTGGTGGAATGATGGCTGTTGACATGGGTGGACCAGTTAATAAA 
GCAGCTTATGTCTTTGGTACAGGTACGCTTGCAGCAACrGTTTCTTCAGGTGGTTCTGTAGCCATC 
TGGCTGGAGGAATGGTGCCACCACTTGCAATCTTTGTCGCAACTCT^ 
ACGTAACTCTGGTTTGACAAACATCATCATGGGCTTGTCATTTATCACTC 

GACCCAGCTCGTGCGATTCCAAGCTTCATCCTTGGTTCAGCAGTAGCAGGTGGACTCGTTGGTCTTACTGGTATCA 
25 AACTCATGGCGCCACACGGAGGAATCTTCGTTATCGCCCTTACT^^ 

GTAGGAGCAATCGTAAGTGGTGTGGTTTATGGTTACCTACGCAAACCACAAGCATAA 

4175.6 

ATGGCAAACAAGAATACAAGTACAACAAGACGGAGACCGTCTAAAGCAGAACTGGAAAGAAAAGAAGCGATTCA 
30 ACGAATGTTGATTTCGTTAGGAATTGCGATTTTATTGATTTTCGCAGCCTTCAAATTAGGGGCTGCA^ 
CTTTATAATTTAATTCGCTTGCTAG 

TTTCAAGTGGATACGAAAACAGGAAGGACTCrrATCTGGCTTTTTCACCATATT^ 

AGGCCTACTTGGTTTGGAAATATGGTTTGGACAAGTCCGTTCTAAAAGGGACCATGGCTCAGGTTGTG ACAGA TCT 
GACTGGTTTTCGAACGACTAGCTTTGCTGGAGGGGCCTTG^ 

35 TTTTCAAATATCGGAACTTACTTTATTGGTTCTATCTTGATTTTAGTGGGTTCT 

TACGATATTGCTGAATTTTTCAGTAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGGAACGC 
TTTGTCAAACAAGAAGAAAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTGAAAA 
AGCCTTACTCGATTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGACAGAGGAAGCTGTTCAAA^ 
CCTATTCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGGAA 

40 GATGACTCAGATGACGAAGATGTTCAGGTCGATTTTTCAGCCAAAGAAGCCCTTGAATACAAACTTCCAAGCTTA 
CAACTCTTTGCACCAGATAAACCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAATATCAAAATOT 
GAAGCAACCTTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGGGCCATCAGTGACCAAGTAT 
GAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCrATCAGATGACCTCGCrCTAGCCTTGGCTG 
CCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCTAATCGGAATTGAAGTGCCCAACT CCGA TATTG 

45 CCACTGTATCTTTCCGAGAACTATGGGAACAATCGCAAACGAAAGC 

AGGCTGrrAATGGAACCGCAAGAGCrTTTGACCTTrCTAAAATGCCCCACTrGCTAG 

AGGGAAGTCAGTAGCAGTTAACGGCATTATTGCTAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTAT 
GATGGTCGATCCCAAGATGGTTGAGTTATCTGTTTACAATGATATTCCCCACCTCTTGATTCC AGTCG TGACCAAT 
CCACGCAAAGCCAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACTCTTTGCCAAGGTG 

50 GGAGTTCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAATTCCG 
CTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGAAGATGCTA 
TCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCGTCCATCTGTTGATGT 
CATCTCTGGTTTGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTCATCAGGAACAGACT 
ATTTTGGATGAAAATGGAGCAGAAAAACTTCTTGGTCGAGGAGACATGCT^ 

55 CCAGTTCGTCTCCAAGGCTCCTTTATCTCGGATGACGATGTTGAGCGCATTGTGAA CTTCA TCAAGACT 

ATGCAGACTACGATGAGAGTTTTGATCCAGGTGAGGTTTCTGAAAATGAAGGAGAATTTTCGGATGGAGATGCTG 
GTGGTGATCCGCTTTTTGAAGAAGCTAAGTCTTTGGTTATCGAAACACAGAAAGCCAGTGCGTCTATGATTCAGCG 
TCGTTTATCAGTTGGATTTAACCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCT 
G AAG GT ACC AAA CCTCG AAAAGTGTT ACAAC AAT AA 

60 

4176.1 

ATGAGTTATTTTAAAAAATATAAATTCGATAAATCCCAGTTCAAACTTGGTATGCGAACCT^ 
CTGTTTTTCTAGTTCTCTTGATTTTTGGCTTTTTTGGCTGGAA^ 

GCCTGAGGGAGAGTTTTGATGAGAGTGTTCATTTTGGGACTTCGCGTATTCTAGGAAATAGTATCGGTGGACTCTA 
65 TGCCTTGGTCTTCTTCTTATTAAATACCTTriTCCACGAAGCCTmGG 
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CCATGTTAACCATTATGACAAATCTAGCCATGAATAACAAAGCAGGGGTTATTGGTGGTGTAGCAGCTATGTTAAT 

CATTACCCTATCAATTCCAAGTGGTGAGACAATTTTC 

TTTGTCGCAATTATCGTAAATTACGATATTGATCGTATTCGT 

5 4178.2 

ATG AATAAATCAGAACACCGCC ACCAACTTATACG CGCTCTTATCACAAAAAACAAG ATTCATACACAGG CTG AG 
TTG C AAG CCCTTCTTG CTG AG AACG AC ATTC AAGT A ACCC AGG C A A C CCTCTC A CG CG AC ATC AAAAAT ATG AAC 
CTATCAAAAGTCCGCGAAGAAGATAGCGCTTATTATGTTCnTAACAATGGTTCCATCrCAAAATGGGAAA 
TCGAACTCTACA TGGA AGACGCCCTTGTCTGGATGCGCCCAGTTCAACACCAAGTCCTACTAAAAACCCTTCCTGG 
10 ACTGGCTCAATCCITTGGTTCrATCATTGATACTTTC 

TCT GTCT TATCATCrGTGAAGATGCAGATACTGCTCAAAAGTGCITrGAAGAACrGAAAAA^ 
TTTCTTTGAAGAATAA 

4179.1 

1 5 ATGAAAAGTATAAAATTAAATGCTCTATCITACATGGGAATTCGTGT 
AACCTATOTCGCGCGTGTCTO 

TGCCCTTTGCAACnTATGGTGTCTATAACTACGGTTTAAGGGCTATCAGTAATGTCAAGGATAACAAAAAAGATCT 
TAACAGAACCTTTrCTAGTCTTTTITAOT 

ATCCTCTCl'l'CIUU'ACrGATAATCCAATCGTCAAAAAGGTCTACCTTGTTATGGGGATTCAACTCATTGCCCAGATT 
20 TTTTCAATCGAATG GGTC AATGAAGCTCTGGAAAATTACAGTTTTCT ClU ' ll 

G ATGCTGGTCTCT ATTTTCTT ATTTGTTAAAAATG AAC ACG AT ATTGTTGTCT AT AC ACTTGTG ATG AGTTT ATCG A 

CGCTGATTAACTACCTGATTAGTTATTTITGGATTAAAAGAGACATCAAACTTGTrAA 

TAAACCACTaTTCTCCCTCTGACAGCCATGTTAGTCmGCCAATGCCAATATGCTOT 

TCITCCTCGTTAAAACAGGGATTGATGTCAACGTTAGTTACTATACCATAGCTCAGCGAATTGTGACCGTTATA^ 
25 TGGGGTTGTAACAGGTGCAATTGGAGTGAGTGTGCCTCGTCTCAGTTACTATCTGGGGAAAGGAGACAAAGAAGC 
CTATGTrrCTCTGGTTAATAGAGGTAGTCGAATCrTTAACITCTT^ 

TAGGACCAAATGCCATCCTACITrACGGTAGTG AAAAAT AT ATCGGAGGCGGCATCTTGACCTCTCTCI^ 
TCGTACGATTATCC TGGC CTTAGATACCATTCTTGGTC 

ACAGTCTATACAGTCTTTGCTGGGCTACTCAATTTGGGCTTGAATAGTCTCC^^ 
30 TGAATACTACTTACTGACAACT ATGCT ATCAGAGACTTCTCTACTTGTTTTCTAT^ 
AACTCATCCACTTGGGACATATCTTTAGCTATACTGTTC^ 

TCCTGATTAATTTCGTGTATCCTGTAGATATGGTCATTAATTTGCCATTTTTGATTA 
ATACTGGTTTGATTGTCTTGCTATCAGCTATCTCT^ 
^ ^ GAATTTTTAAACCATGTCCTAGCCTTAAAAAATAAATT^ 

4179.2 

ATGAAACAACTAACCGTTGAAGATGCCAAACAAATTGAATTAGAAATTTTGGATTATATTGATACT 
AGCACAATATCAACTATATTATTAACTACGGTACTCTGATTGGGGCGGTTCGACATGAGGGCTTTATCCCTTGGGA 
CGACGATATTGATCTGTCCATGCCTAGAGAAGACTACCAACGATTTATTAACATTTTTCAAAAGGAAAAAAGCAA 
40 GTATAAGCTCCTATCCTTAGAAACTGATAAGAACTACTTTAACAAC^ 

ATTATTGATACTCGAAATACAAAAACCTATGAGTCTGGTATCnTATCGATATTTTCCCTATAGATCGCT 

TCCTAAGCTCAT TGATA CTTGTTATAAACTGGAAAGCTTCAAACT 

TATAAGGATAGCOTITAAAAGATTGGATACGAACAGCCTrCTG^ 

TG C AAAT AAAATCG AG AAA G AAATTC AA AAAT AT AGTCGTG AAA ATGGGC AAT AT ATG G CTTTT ATCCCTTC AAA 
45 ATTTAAGGAAAAGGAAGTCTTCCCAAGTGGTACCTTT 

CCTGCACCTGAAAA-ATTTGATACrATTTrGACACAATTTTATGGAGATTATATGACCCT 
GCTTCTACAGTCATGAATTTCACGCTTATAAATTGGAGGATTAG 



50 ATGATAAAAATCAATCATCTAACCATCACACAAAACAAAGATTTACGAGATCTTGTATCTGACCTAACCATGACC 
ATCCAAGACOTGGAAAAGGTTGCTATTATTGGTGAAGAAGGAAATGGCAAATCAACCTTACIT 
GGGGAAGCITTGTCTGATTTCACTATCA AGGG AAACATCCAATCTGACTATCAGTCACTGGCCTACATTCCTCAAA 
AAGTCCCTGAGGACCTAAAAAAGAAA ACTT TACACGACTAOTCnTn'AGA 
CCTCTATCGTT TGGC GGAGOAATTGCATTTTGATAGCAATCGTTrCGCAAG 

55 GGGGGCGAAGCTTTGAAAATTCAGCITATCCATGAGTTAGCCAAACCCTITGAGAT^ 

CAAATGACCTAGACCn TGAG ACAGTTGATTGGCTAAAAGGCCAGATTCAAAAGACCAGGCAAACCGTTATTTTCA 
TITCCCATGATGAAGACnTTCTTTCrGAAACGGCAGACACTATTGTTCACITG 

AGCGGAAACGCTAGTAGAGCATTTAGACTATGATAGCTATAGTGAGCAGAGAAAGGCTAATTTTGCCAAACAAAG 
TCAGCAAGCTGCTAACAACCAAAGAGCCTACGATAAAACCATGGAAAAACATCGGAGAGTTAAGCAAAATGTAG 
OO AAACTGCGCrrCGAGCTACCAAAGATAGTACTGCCGGTCGCCTATTGGCTAAAAAGATGAAAACTGTCCTCTCA^ 
AAGAAAAACGCTACGAAAAGGCAGCTCAGTCCATGACTCAAAAGCCACTTGAAGAGGAACAAATCCAACITrTCT 
TTTC AG A C ATCC AACC ATT ACCAG CTTCT A AAGTCTT AGTCC A ACTGG AAAAAG AAAATTTGTC C ATTG ACG A CCG 
AGTTTTGGTTCAAAAACTACAACTAACTGTCCGTGGCCAAGAAAAAATCGGTATTATCGG 
. GCC AAAT GGTGTTGGGAAATCAACTCTGTTAGCCAAGTTACAGAGACTTCrGAATGATAAAAGAGAGATTTCACT 
o5 TGGTTTTATGCCACAAGATTACCACAAAAAACTGCAATTGGATTTATCCCCAATAGCCT 
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GAAAAAGAGGAACTACAGAAAATCCAATCTCACCTAGCTAGTCTCAATTTCAGTTATCCAGAAATGCAGCATCAA 
ATTCGCTCCTTATCTGGCGGACAACAGGGAAAACTCCTGCTTTTGGATTTAGTCCTGCG 
TGCTGGATGAACCCACACGAAACTTTTCTCCCACTTCTCAACCCCAAATCAGAAAACTCTTTGCT 
CGGTCTCATCACTGTTTCGCATGACCGTCGTTTCTTAAAAGAAGTCTGCTCGATCATCTATCGCATGACAGAACAC 
5 GGTTTGAAGCTAGTTAATTTAGAAGATTTATAA 

4179.4 

ATGAAACCAAAAACATTTTACAACTTGCTTGCCGAGCAGAATCTTCCA 
GAACGTTATTTTGAGCTCrTGGTCGAGTGGAATGAGAAGATTAATTTGACGGCGATTACGGA 

10 TATCTCAAACATTTTTACGATTCGATTGCACCCATTCTTCAAGGTTTGATTCCCAATGAA^ 

TATCGGGGCTGGGGCAGGATTTCCTAGTCTACCAATGAAAATTCTCTATCCGGAGTTAGATGTGACCATTATTGAT 
TC ACTCAAT AAGCG C ATC AACTTC CT AC AACTCTTGGCTC AAG AACTGG ATTTG AACG G AGTTC ATTTCT ACC ACG 
G ACGTGCCG AAG ATTTTG CCC AAG A C AAG AACTTCCGTG CTC AAT ATG ATTTTGT AAC AG CTCGTG CG GTTG CCCG 
TATGCAGGTCCTATCTGAATTGACTATTCCCTACCTTAAGGTTGGTGGCAAACTATTAGCACTCAAGGCTAGCAAT 

15 GCGCCTGAGGAATTATTAGAAGCTAAGAATGCCCTCAATCTCCTTTTTAGTAAGGTCGAAGACAATCT^ 

CCCTACCGAATAGAGATCCGCGCTATATCACAGTGGTAGAAAAGAAAAAAGAAACACCAAATAAATATCCACGTA 
AGGCTGGTATGCCAAATAAACGCCCACTTTAA 

4179.6 

20 ATGAGTATTAAACTAATTGCCGTTGATATCGACGGAACCCTTGTCAACAGCCAAAAGGAAATCACTCCTGAAG^ 
TTTCTGCCATCCAAGATGCCAAAGAAGCTGGTGTCAAAGTCGTGATTGCAACTGGCCGCCCTATCGCAGGCGTTGC 
CAAACTTCTAGACGACTTGCAGTTGAGAGACGAGGGGGACTATGTGGTAACCTTCAACGGTGCCCTTGTCCAAGA 
AACTGCTACAGGACATGAGATTATCAGCGAATCCTTGACTTATGAGGATTATCTAGATATGGAATTCCTCAGTCGC 
AAGCTCGGTGTCCACATGCATGCCATTACCAAGGACGGTATCTATACTGCAAATCGCAATATCGGAAAATACACT 

25 GTACACGAATCAACCCTCGTCAGCATGCCTATCTTCTACCGTACCCCTGAAGAAATGGCTGG CAAA GAAATTGTTA 
AATGTATGTTTATCGATGAACCAGAAATTCTCGATGCTGCGATTGAAAAAATTCCAGCAGAATTTTACGAGCGCTA 
CTCCATCAACAAATCTGCTCCTTTCTACCTCGAACTCCTTAAAAAGAATGTAGACAAGGGTTCAGCCATT 
TTGGCTGAAAAACTCGGATTGACCAAAGATGAAACCATGGCAATCGGTGATGAAGAAAATGACCGTGCCATGCTG 
GAAGTCGTTGGAAACCCCGTTGTCATGGAAAATGGAAATCCAGAAATCAAAAAAATCGCCAAATACATCACCAAA 

30 ACAAATGACGAATCCGGCGTTGCCCATGCCATCCGAACATGGGTACTGTAA 

4179.7 

ATGACTTGGATTATTCTTGGAGTTATCGCTCTTATTGTTATTTTTGTGATTGTTAGCTA 

TCGTATGCAAACCAAGGAGGCTTGGAGTCAGATTGATGTTCAGTTGAAACGTCGCAATGACCTCTTGCCAAACTTG 
35 ATTGAGACTGTAAAAGGTTATGCCAAATATGAAGGTTCTACCCTTGAAAAGGTGGCAGAACTACGTA ACCAA GTG 
GCGGCAGCGACrrCACCAGCAGAAGCTATGAAAGCCAGTGATGCCCTCACTCGTCAGGTTTCAGGTATTTTTGCAG 
TTGCAGAAAGCTATCCAGATTTGAAAGCTAGTGCTAACTTTGTTAAATTGCAAGAGGAGTTGAC 
ATAAAATTTCTTACTCTCGTCAACTCTATAACAGTGTTGTCAGCAACTACAATGTAAAATTAGAA^ 
CAATATTATCGCTGGAATGTTTGGATTTAAAGCGGCAGATTTCCTTCAAACACCTGAAGAGGAAAAGTCGG 
40 AAAGTTGATTTTAGCGGTTTAGGTGACTAA 

4179.8 

ATGTTGTTTGATCAAATTGCAAGCAATAAACGAAAAACCTGGATTTTGTTGCTGGTATTTTTCCTACT 
TGTTG GTT ATG CG GTTGGTT ATCTCTTT AT A AG ATCTGG A CTTGGTGGTTTG GTT ATTGC ACTG ATT ATCGGCTTTA 
45 T CT ACG CTTTGTCT ATG ATTTTTC AATCG AC AG AG ATTGTC ATGTCC ATG AATGG AG CG CGTG AGGTGGATG AG C A 

AACGGCACCAGACCTCTACCATGTAGTGGAAGATATGGCTCTGGTCGCTCAGATTCCTATGCCCCGTGTTTTCATC 
ATTGATGATCCAGCCTTAAATGCCTTTGCGACAGGTTCTAATCCTCA^^ 

TACTAGCTATCATGAATCGTGAAGAACTAGAAGCTGTTATGGGACATGAAGTCAGTCATATTCGTAATTATGATAT 
CCGTATTTCGACTATTGCAGTTGCCCTTGCTAGTGCTATCACCATGCnTCTAGTATGGCAGGTCGTATGATGTGGT 
50 GGGGTGGAGCAGGTCGCAGACGAAGTGATGATGACCGAGATGGAAATGGTCTTGAAATCATTATGCTAGTGGTTT 
CCCTACTAGCTATTGTACTGGCACCTCTCGCTGCAACCITGGTTCAGCTCGCTATTrCT 

GCAGATGCATCTAGTGTCGAGCTGACTCGCAATCCCCAGGGAATGATTAATGCCCTAGATAAGTTGGACAATAGC 
AAACCTATGAGTCGCCACGTCGATGATGCTAGCAGTGCCCITTATATCAATGATCCTAAGAAAGGTGGGGGGTTC 
CAAAAACTCITTTATACCCACCCACCTATCTCAGAACGGATTGAACGTTTAAAACAGATGTAA 

55 

4179.9 

ATGAAATTAAATATTCAAGAAATTCGTAAGCAGTCTGAAGGTTTGAACTTrGAACAAACGTTAGATTTAGT^ 
ACCTGCGTGCACGTAATCAAGAAATTTTAGATGTAAAAGATATCCTTGCAGTTGGGAAAGTACAATATGAAGACC 
GTATGTATTTCTTAGATTATCAACTATCTTATACCATTGTTCTTGCTTCGAGTCGCAGTATGGAGCC^ 
60 GTTGAATCTTATCCAGTCACGGAAGTTTTCATGGAAGGCGCAACTAACCAGCTAGATCAAGAAGTTTTAGATGATG 
ACrTGGTCTTGCCCATCGAAAATGGGGAGCTTGACCTTGCTGAGAGTGTATCAGACAATATCCTGCTAAACATTCC 
TATCAAGGTCTTGACGGCTGAAGAAGAAGCTGGTCAAGGATTTATCTCAGGAAATGACTGGCAAATCATGACAGA 
GG A AG A AT ACC AAG CTC AAAAAG C AGT A A AG A A AG AAG AAA A C AGTCCTTTTG CTGG CTT AC A AGG ACT ATTTG A 
CGGAGATGAATAA 

65 
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4179.12 

ATGGAGTTATTTATGAAAATCACAAACTATGAAATCTATAAGTTAAA^ 

TGAAAGTGCTAGAATACGGTGAAAATGTTGATCAGGAGCTTTTGTTGGGTGATATTGCAGATATCTCAGGTTGCCG 
TAATCCAGCCGTTTTTATGGAACGTTATTTTCAGATAGACGATGCGCATTTGTCGAAAGAGTTTC 
5 TCTTrcrCTATTTTAGATGACTGTTATCCTTGGGATTTGAGTGAAATATATGATGCGCCT 

GGGAAATCTTGACCTCCTGAAATTCCCGAAGGTAGCGGTCGTGGGCAGTCGTGCTTGTAGCAAACAGGGAGCTAA 
GTCAGTTGAAAAAGTCATTCAAGGCTTGGAAAATGAACTGGTTATTGTCAGTGGTCTGGCCAAGGGCATT GACA C 
AGCAGCTCATATGGCAGCTCTTCAGAATGGCGGAAAAACCATTGCAGTGATTGGAACAGGACTGGATGTGTTTTA 
TCCTAAAGCCAATAAACGCTTGCAAGACTACATCGGCAATGACCATCTGGTTCTAAGTGAATATGGACCTGGTGA 

10 ACAACCTCTGAAATTTCATTTTCCTGCCCGTAATCGCATCATTGCT 

GCTAAGATGCGTTCAGGTAGTCTCATTACGTGTGAGCGAGCAATGGAAGAAGGACGCGATGTCTTTGCTATTCCTG 

GTAGCATTTTAGATGGACTATCA GACG GTTGCCATCATTTGATTCAAGAAGGAGCAAAATTGGTCACCAGTGGGC 

AAGATGTTCTTGCGGAATTTGAATTTTAA 

15 4181.1 

ATGAAACGTCAATTAGCCITGGTCGTCTTTAGTGGTGGTCAAGATTCAACAACCT 

ACTATGAAACAGTCGAAGCTGTCACCTITGCCTACGGCCAACGTCATCACCTCGAAATTCAAATTACTAGAG AAAT 
CGCTAAGGAACAGGGCATTCGTCACCATATCCTCGATATGTCTCTGCTGGGACAAATCACTGCTCAGCCAGACTTT 
GCGACGATTCATATTTCCTACATTCCTGACAAGCTCTGTGTC 
20 ACCGAAACCACGGAGATTTCCACGAAAACTGTATCAACACCATCGGGAAAGACTTGGTCAACTTGCTAGACCCTC 
GCTATTTAGAAGTCTGGGGAAAATTCACTCCGCGCGGTGGCATTTCAATCGACCCCTACTACAACTACGGTAAGCA 
AGGAACTAAGTATGAGGGCTTGGCAGAACAACGCCTCTTCCAACACGACCTTTATCCAGAGAAAATTGACAACCG 
CTAA 

25 4181.2 

ATGACCGAAACGGTAGAAGATAAAGTAAGTCATTCAATTACTGGGCTTGATATCCTCAAGGGGATAGTTGCTGCG 
GGAGCTGTCATAAGTGGAACCGTTGCAACTCAAACGAAGGTATTTACAAATGAGTCAGCAGTACTTGAAAAAACT 
GTAGAGAAAACGGATGCTTTGGCAACAAATGATACAGTAGTTCTAGGTACGATATCTACAAGTAATTCAGCGAGT 
TCAACTAGTTTGTCAGCTTCAGAGTCGGCAAGTACATCTGCATCTGAGTCAGCCTCAACCAGCGCTTCGACCTCAG 
30 GAAGTACAAGTGCATCAGAATCAGCAAGTACATCGGCTTCGACAAGTATTTCTGCATCATCTACTGTGGTAGGTTC 
ACAAACAGCTGCCGCTACAGAAGCAACTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACCAGCTAGTGATTATGT 
AGCATCAGTTACAAATGTCAATCTCCAATCTrATGCTAAGCGACGCAAGCGTTCAGTGGATTCCATCGAGCAATTG 
CTGGCTTCTATAAAAAATGCTGCTGTTTTTTCTGGCAATACGATTG 

TAAACATTGCTAAAAGTGAGACAAAAGTTTATACAGGTGAAGGTGTAGATTCGGTATATCGTGTTCCAATTTACTA 

35 TAAATTGAAAGTGACAAATGATGGTTCAAAATTGACCTITACCTATACGGTTACGTATGTGAATCCTAAAACA 

GATCTTGGTAATATATCAAGTATGCGTCCTGGATATTCTATCTATAATTCAGGTACTTCAACACAAACAATGTTAA 
CCCTTGGCAGTGATCTTGGTAAACCTTCAGGTGTAAAGAACTACATTACTGACAAAAATGGTAGACAGGTTC TATC 
CTAT AATAG\TCT ACAATG ACG ACGCAGGGT AGTGGGT AT ACTTGGGG A AATGGTGCCCAAATGAATGG I" r I L 1 1 1 
GCTAAGAAAGGATATGGATTAACATCATCTTGGACTGTACCAATTACTGGAACGGA 

40 TACATCCTTTACATTTACCCCTTACGCTGCTAGAACAGATAGAATTGGAATTAACTACTTCAATGGTC 

GTAGTTGAATCTAGCACGACCAGTCAGTCACTTTCACAGTCTAAGTCACTCTCAGTAAGTGCTAGTCAAAGCGCCT 
CAGCTTCAGCATCAACAAGTGCGTCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAG 
CTTCAGCAAGTACCAGTGCTrCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCCTCGGCTTC 
AGCAAGCACATCAGCATCTGAATCAGCGTCAACCAGTGCTTCGGCTTCAGCAAGTACCAGTGCTTCAGCTTCAGC 

45 ATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCGGCCTCAGCAAG 
C ACCTC AGCTTCTG AATCGGCCTC AA CC AG CG CCTC AG CCTC AG C ATC AACG AGTGCTTCGG CTTC AG CAAG C AC 
AAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCrTCAGCGTCAACCAGTGCTTCAGCCTCAGCATCAACAAG 
TGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCAGCATCAACGAGTAC 
GTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTC 

50 AGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCGTCGACAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGC 
ATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCA 
ACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACA 
AGTGCCTCGGCTTCAGCAAGCACATCAGCATCTGAATCAGCGTCGACAAGCGCCTCAGCTTCAGCAAGTACCAGT 
GCGTCAGCTTCAGCATCAACC^GCGCCTCGGCCTCAGCAAGCACCTCAGCTTCrGAATCGGCCTCAACCAGCGCC 

55 TCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCAGCCTCAGCATCAACGAGTGCTTCG 
GCITCAGCAAGCACAAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCC 
TCAGCATCAACAAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCA 
GCATCAACGAGTACGTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCA 
TCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCTCGACAAGTGCGTCGGCCTCAACCAGTG 

60 CATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCAT 
CGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGT 
CTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCcTCGGCTTCAGCAAGCACATCAGCATCTGAATCA 
GCGTCG AC AAG CG CcTC AG CTTC AG C AAGT A CC AGTGCGTC AG CCTC AGCGTCG AC A AGTG CGTC AGCCTC AGC A 
AGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCATCGGCTTCGGCGTCAACCAGTGCATCAGAGTCAGCAAGT 

65 ACCAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACT 
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AGCGCCTCAGCCTCAGCCTCAACCAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGT 
GCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCTTCAGCGTCAACGAGTGCG 
TCTGAATCGGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCG 
GCTTCAGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCTT 
5 CAGCGTCAACGAGTGCGTCTGAGTCAGCATCAACGAGTGCGTCAGCCTCAGCAAGCACATCAGCTTCTGAATCTG 
CATCAACCAGTGGGTCAGCCTCAGCATCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGT 
CGACAAGTGCGTCGGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGA 
CAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCAT 
CAGCTTCAGCATCAACGAGTGCATCGGCTTCAGCATCAACCAGTGCATCAGAGTCAGCAAGTACCAGTGCGTCAG 
1 0 TTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGTACTAG 

4183.1 

ATGGGGGTCGAAACTTGGTTTTATTCTAGCATCTGCTGGCTGGCCATCGGGCTTG 
ACATGACTGCTGCTAATGGCGGTGGAGGCTTTTTACTAATCTTTCTCAmCCACTA 
15 CTGCTGGCTGAGTTTGCCCTTGGCCGTAGTGCTGGCGTTTCCG 

ACAAGTACAACTrTATCGGTTGGATTGGCGCCTTTGCCCTCITTATCCTC^ 

TGGATTCTAGTCTATCTAGGTATTGAGTTTGGGAAATTGTTCCAACTTGGTGGAACGGGTGATTATGCTCAGTTAT 
TTACrrCAATCATTTCAAATCCAGCCATTGCCCTAGGAGCTCAAGCGGCCTTTATCC TATT GAAT ATCT^ 
TCACGTGGGGTTCAAAAAGGGATTGAAAGAGCITCGAAAGTCATGATGCCCCTGCTCTTTATCGTCm 
20 TCATCGGTCGCTCTCTCAGTTTGCCAAATGCCATGGAAGGGGTTCTTTAOT 

GACTAGCACTGGTCTCCTCTATGCTCTGGGACAATCTTTCTTTGCCCTCTCACTAGGGGT^ 

ATGCTTCTTACTTAGACAAGAAAACCAATCTAGTCCAGTCAGGAATCTCCATCGTAGCCATGAATATCTCGATATC 
CATCATGGCAGGTCTAGCCATTTTCCAAGCTCGATCCCCCTTCAATATCCAGTCTGAAGGGGGACCCAGCCTGCTC 
TTTATCGTCTTGCCTCAACTCTTTGACAAGATGCCTTTTGGAACCATTTTCTACG 
25 TTTGCGACAGTCACTTTTTCTGTCGTGATGCTGGAAATCAATGTAGACAATATCACCAACCAGGATAACAGCAAAC 

GTGCCAAATGGAGTGTTATTTTAGGAATTTTGACCTTTGTCTTTC 
GCGGATGTTCACATTTTTGGTAAGACCTTCTTTGACGCTATG 
AGCTCTCTACCrrrCACTTTTTACAGGCTATATCTTTAAAAAGGCTCI^ 
AGAGCATGGAAACAAGGACTGTTCCAAGTCTGGCTCTTCCTTCTTCGTTTCTTCGTTTCGTCA 
30 ATTGTGGTCTTCATTGCCCAATTTATGTAATCAAAAAGGACTTGAGTAG 

4183.5 

ATGTTG AAAAAATGG C AGTT A AAAG ATGTT ATCTTGCTTGCTTTCTTGTCT ATCTTTTTTGGTG G GGTTTTCGTTGG 
TTCAGGATATGTGTATAATATTCTCAGTCTACTCTTAACACCTCTTGGTTTGCAGGCCTTTGCCAATGAAAT 

35 TCGGTCTCTGGTGTATGGCTGCGCCCATTGCTGCCATCTTrGTTCCGAGAGTCGGAAGTGCA ACGAT TGGAGAAGT 
GCTAGCTGCGCTTGCTGAAGTCCTTTATCGTAGCCAATTTGGTCT^ 

TGGGAAGTGAATTTGGTTTTATCGTAACTAAGAATCGCTATGAAAGTTGGCTCTCTCTAACTGCT 
GATTACGCTTGTTAGCTTTGTCTATGAATACATTAAGTTAGGTTACTACGCCTTTTCCC^ 
GCrTGTGGTACGTTTTATTTCTGTTTATTTCTTCTGTACCATCTTGGTTCGTGCCATTGTCAAACT 
40 TGCAACTGGAGGAAAAGCATAG 

4183.6 ' 

ATGGTCAAAGTAGCAACCCAGACACCGATTATCAGTCTCrrrCTTGCTGATTT^ 
TTCGATTGCTCTGACTCTTTCGGTAGTCGCATTTTGTATTCTCTTTATGCTCTATTACCGTCGATT^ 

45 CTTGGATGATCATACTTGCCATTTTACCATCTTTTGCCAACTACTGGGCAGTTCAGTTACACGGAGATGCTTCACA 
GGCAGTCATGCTTGGAACGAGGGCCTTTGTGACAGTTTGTATCGGCCTTGTCITrGTTTCCTCT^ 
AGCTTCTCTTGTACTTGGCTCAAAAGGGG CTATC ACGCTCTTGGTCCTATGCCTTG ATTGTGGTAT TCAAT TC 1 1 1 1 
CCTCTCATTCAGCAAGAAATCAAGTCCCTCAAAGAAGCTTGCCTATTACGTGGTCAAGAACTACATTTTTGGTCGC 
CCTTGATTTACAGTAAGGTTCTGATGACAGTCTTTAGGTGGCGCCATCTTTACCTGAGAGCTCTATCTGCTCACGG 

50 ATATG ACGAACATGCACAGTTGAAGAAT AGCTAT CGGACTTTTTATATTCCTAAAAAAACAAAATTAATCTACCTG 
CTTTTCTTTTTATTG CTTCA AACC AGTCT ATTTTT AT A A 

4183.7 

ATGAGAAAGCACCAATTACAAGTTCACAAATTAACCATTTTATCTATGATGATTGCCCTTGATGTAGTCCTTACAC 
55 CTATCITrCGAATTGAGGGAATGGCACCGATGTCCAGTGTAGTCAATATTCTAGCAGGAATCATGATGGGACCTGT 
TTATGCCTTGGCTATGGCTACAGTCACAGCCTTTATCCGTATGACGACTCAAGGGATTCCGCCTTTAGCTCTCACA 
GGAGCGACnTTTGGAGCCCrrCTAGCAGGTCTCTTTTATAAGTACGGTCGAAAATTTCA 
AGATTTTGGGAACAGGTATTATTGGTTCCATTGTTTCCTATCCTGTTATGGTACTCTTTACAGGATCAGCT G 
CTTAGCTGGTTTATCTACACGCCTCGATTTTTCGGAGCAACCTTGATTGGTACAGCGATTTCCTTT 
60 ATTTTTAATCAAGCAGGAATTCTTTAAAAAAGTGCAGGGATA rr TCl' rr AGTGAAAGGATAGACTGA 

4183.8 

ATGCAGGAATTTACAAATCCCTTTCCTATAGGCTCTAGTTCCCTCATTCACTGCATTACCAATGAGATTTCTT 
GATGCTGGCAAATGGGATTTTGGCTCTGGGATGCAAACCTGTCATGGCAGATGATTCCCGTGAAGTTCTTGATTTT 
65 ACTAAGCAAAGTCAGGCTCTCTTCATCAATTTGGGGCATTTGTCAGCTGAGAAGGAAAAAGCAATCCGCATGGCA 
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GCTTCGTATGCAAACCAATCTrCn'CTCCCGATGGTAGTAGATGCGGTTGGCGTAACGACTTCATCCATTCGTAAGA 

GCTTAGTTAAAGACCTTTTAGACTATAGACCTACGGTCCTTAAAGGAAACATGTCAGAAATT 

ATTAAAGCACCACGGCGTTGGGGTCGATGCGAGTGCTAAAGATCAAGAAACGGAGGATTTGCTTCAAGTCTTGAA 

AGACTGGTGTCAGACCTATCCTGGTATGTCTTT<nTAGTCACAGGTCCCAAGGACCTCGTCGTTTCGAAAAATCAG 

GTCGCTGTACTGGGAAATGGCTGTACTGAATTAGACTGGATAACAGGGACAGGAGACTTGGTTGGAGCCTTAACA 

GCTGTTTTTCTCAGCCAAGGAAAGACTGGTTTTGAAGCTTCTTGCT^ 

GAAAATAGTTGTTCAAGGAATGGGATTGGAAGAATTTCGTTACCAAGTACTCAATCAGCTTTCGCTCCTAAGAA^ 
AGATGAAAATTGGCTAGATACCATCAAAGGAGAGGTTTATGAATAG 



4185.3 

ATGAACCATAAAATCGCAATTTTATCAGATGTTCATGGCAATGCGACGGCGCTAGAAGCAGTGATTGCAGATGCT 
AAAAATCAAGGGGCCAGTGAATATTGGCTTCTGGGAGATA'IMMM'rCrrCCTGGTCCAGGCGCAAATGACTTAGTCG 
CCCTGCTAAAGGACCTTCCTATCACAGCAAGTGTTCGAGGCAATTGGGATGATCGTGTCCTTGAGGCTTTAGATGG 
GCAATATGGCTTAGAAGACCCACAGGAAGTTCAGCTCTTGCGTATGACACAGTATTTGATGGAGCGAATGGATCC 
TGCAACGATTGTCTGGCTACGAAGCTTGCCTTTGCT^ 

CATAATTTACCTGACAAAAACTATGGTGGTGACTTGCTAGTTGAGAATGATACAGAGAAATTTGACCAACT 
GATGCGGAAACGGACGTGGCAGTTTATGGTCATGTTCACAAGCAG'TTGCTTCGTTATGGAAGTCAAGGGCAACAA 
ATC ATC AATCC AG G GTCG ATTGG C ATG CCCT ATTTT AATTG GG AGGCGTT AAA AAATCACCGTTCCC AGT ATG CCG 
TGATAGAAGTTGAAGATGGGGAATTACTCAATATCCAATTTCGTAAAGTTGCTTATGATTACGAAGCTGAGTTAGA 
ATTG GCC AAGTC C A AGGGGCTTC C CTTT ATCG AAATGT ATG AAG AACTGCGTCGTG ACG AT AACT ATC AGGGG C A 
C AATCTGG A ATT ATT AGCC AG CTT AAT AG AAAAG C ATG GGT ATGT AG AGG ATGTG A AG AATTTTTTTG ATTTTTTG 
TAA 

4186.1 

ATGAATGTAAATCAGATTGTACGGATTATTCCTACTTTAAAAGCTAATAATAGAAAATTAAATGAAAC^ 

TTGAAACCCTTGGAATGAAGGCCTTGTTAGAAGAATCGGCCTTTCTGTCACTAGGTGACCAAACGGGT 

GCTGGTTTTAGAAGAAGCTCCCAGTATGCGTACTCGTAAGGTAGAGGGAAGAAAAAAACTAGCTAGATTGATTGT 

C AAGGTG G AAAATCCCTTAG AAATTGAAGG AATCTT ATCT AAAAC AG ATTCG ATTC ATCG ATT AT AT AAAGGTC A 

AAATGGCTACGCTTTTGAAATTTTCTCACCAGAAGATGATTTGATTTTGATTCATGCGGA 

CTAGTAGAAGTAGGAGAAAAGCCTGAATTTCAAACAGATTTGGCATCAATTTCITrAA 

TGG AATT ACATCTCCC AACTG AT ATCG A AAG II TC 1"1 G G AATCATCTG AAATTG G GGC ATCCCTTG ATTTT ATTCC 
AGCTCAGGGGCAGGATTTGACTGTGGACAATACGGTTACCTGGGACTTATCTATGCTCAAGTTCITGGTCAATGAA 
TTAGACATAGCAAGTCTTCGCCAGAAGTTTGAGTCTACTGAATATTTTATTCCn*AAGTCTGAAAAATT 
GT AA AG AT AG AAAT AATGTTG AATTGTGGTTTG AAG AAGT ATG A 

4186.2 

ATG AAGTGGACCAAGATT ATT AAAA AAAT AGAAGAACAAATCGAGGCAGGGATTTATCCCGGAGCCTCTTTTGCG 

TATTTTAAGGACAATCAATGGACAGAGTTCTATTTAGGCCAGAGTGACCCAGAGCATGGCTTGCAGACTGAGGCA 

GGACrAGTTTATGACCTAGCTAGTGTCAGCAAGGTTGTTGGGGTTGGCACAGTTTGTACCnTCTTGTGGGAAATAG 

GTCAATTAGATATTGATAGACTGGTAATAGATTTTTrACCrGAGAGTGATTATCCAGACATCACTATTCGCCAGCT 

CnTGACTCATGCAACAGACCITGATCCrrTTATTCCTAATCGTGATCITITAAC 

ATGTTTCATCTCAACAGACGAAGTCAGCCAGCCmCTTTATTCGGATGTCCATTTm 

GGAAAGAATTTTTAATC AAG ATTTGG ATGTG ATTTT AAAGG ATC AAGTCTGGAAACCTTGGGGAATG 

TAAGTTTGGGCCAGTTGAGCTTGCTGTTCCAACAGTTAGAGGTGTAGAGGCAGGCATAGTGCATGATCCCAAGGC 

TCGTCTCCTGGGTAGACATGCTGGGAGTGCTGGTTTATTTTCGACTATAAAGGATTTACAAATCT^ 

TATTTAGCAGATGATTTTGCAAGAGACITAAATCAAAATTTTTCrCCT^ 

GGAATTTGGAAGGAGATTGGCTAGACCATACGGGCTATACAGGTACCTTTATCATGTGGAATCGTCAGAAGCAAG 
AAGCCACTATTTTCCTATCGAATCGTACCTATGAAAAGGACGAGAGAGCTCAATGGATATTAGACCGCAATCAAG 
TGATGAACTTGATTCGCAAAGAAGAGTAA 

4187.2 

ATGATGAAGAAGACnTATAATCATATTTTGGTCTGGGGAGTCATTTTCTATAGCAT^ 
TACTCCTCAAGAACAATCTACCGTGGGAGTGGGAACrCCAGGTATTCAGCATCTTGGACGCCTGGTTm 
ACTCCnTTCAATTCTCTCTGGAAACTGGGCGAAGTGAGTGACATTGGACAATTATGTTGGA ITlTnTACAAAATA 
TCCTCAATGTCTTCTTGTTTTTTCCT 

GTCCn un ri l AG III I Cl rGTGAGTCTTGGAATCGAGTGTACGCAATTAATCTTGGACrn i l C 11 1GATTTCAAT 
CG CGTCTTTG AG ATTG ATG ATTTGTGG ACC A AC ACTTTG G GTG GCT ATCTGG CTTGG CTCCTTT AT AAACG ATT AC 
AT AAA A AC AAG GT AAG G A ATT AA 
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4188.1 

ATGAAGATTCCTCTCTTAACTTTTGCAA^ 

TATCGTGATGTTTTGATGACTTATTTCTTTTTTGATATTCATGCGCCCGATCT 
TAAAAATGACTTATTAAAATCAGCATTAGATTTTCGTATTCT 
5 TTCCAATCATCATTGTTTTGCTAGGTTTTCAATATATTGAGCTGAAAAATAAAGTTTTACGATTGAGT 
AGAAGTGAGTTATCAAGGGTTAAAAAGAAAGTTGACTTTGCAAGTTGCAAGTATCCCTTGTTTGATA T 
ACTGTGCTGATAATTGCAATTATAACCTATTTCTTTGGGACTTmCTCCT 
GGAAGTGGTITACAAAGACrrCCTAGATGGAGAGATAAAAAGCTATTTGTTCrTTAOT 
TTTTCATCAATGCAATCTATTTTTTACAAATAGTTGATTATGTGGGGAATGTGACT 
10 ATGTTTCTTTGGCTTGGTTCTATGCTGCTTTATAGTGCCnTGCCnTACTA 

AGCTAGCTATGGGGATGTAAGTTTGATGAAACTCTTTACTCCTTATATCCTITATATTGTCCCT 
AAAAAT ATG A AG AT AATGTTT AA 

4188.2 

15 ATGAAGAT AATGTTT AAGAATTrTAACAATATTTTGCTAAATAGAAAGATTGTTTT ACT 

GATGATTTTGATAAACCATCTATTGTCAACAGCGGTTCAAAAGCAGGATGCT 
ATTTCAATTTTTTCCTATAATGACTATTCTGAAGCGAATTTAGAAATCCCCAAACT 

C ATGGT AGG ATG G CTCTCTGT C ATTTT ACTTG AAAGTG ATTTGG C AG ACC ATTACC ATC ACTTG ATTCG CT ATC AA 
TCAAGCTCCTTTTTCGATTATACAAGGAAACGATTGGTTGTCATre 
20 GTTTCTTGGTrrACTTCCTCTAGGAATTCATTrCAAAACAGTCGCACr^ 

GTACTTACTACTGTCrTATCTGATAGCACTGATTAGTGCGGGCGCTGGTTTTTCCTTTT^ 

TGTGGGACAAGAATGGATGATGGATCATATTGTAACAGTGTATTTAGTACTCTTAAGTTTATTAGTTATGTTGAT^ 
GTT AGTCG CTTGG A AG AG AA ATTT AAG AAAG G AT A A 



ATGGGCAAAGGAGAGATGGGCAAAGGAGTTATTGGCnTGGAGTTCGACTCAGAAGTATTGGTCAACAAGGCTCCA 
ACCCTTCAATTGGCAAATGGTAAAACAGCGACmCCTAACCCAGTATGATAGCAAGACCTTGTTGm 
ATAAGGAAGATATCGGACAGGAAATTATTGGTATAGCTAAAGGAAGCATCGAAAGTATGCATAATCTTCCTGTAA 
ATCTAGCAGGTGCCAGAGTTCCTGGCGGAGTAAATGGTAGCAAAGCAGCGGTGCATGAAGTTCCAGAATTTACAG 
30 GGGGAGTTAATGGTACAGAGCCAGCTGTTCATGAAATCGCAGAGTATAAGGGATCTGATTCGCTTGTAACTCTTAC 
T AC AAAAAAAG ATT AT ACTT ACAAAC CTCCTCTTG CTC AG C AGG C ACTTCCTG AAAC AGG AAAC AAGG AG AGTG A 
CCTCCTAGCTTC ACT AGG ACT AAC AGCTTTCTTCCTTGGTCTGTTTACG CT AGG G A AAAAG AG AG AA C AAT AA 

4188.10 

35 ATGTTTAAAGTTTTACAAAAAGTTGGAAAAGCITTTATGTTACCTATAGCTATACTTCCTGCA 

GGGGATTGGTGGTGCACTTTCAAACCCAACCACG ATAGCAACTTATCCAATACTAG ACAATAGTA 1 1 1 1 1 CAATCA 
ATATTCCAAGTAATGAGCTCTGCAGGAGAGGTTGTATTCAGTAATTTGTCACTACTTCTCTGTGTGGGATTATGTA 
TTGGCITAGCGAAACGAGATAAAGGAACCGCTGCGTTAGCAGGAGTAACTGGTTACITAGTTATGACTGCAACGA 
TCAAAGCTTTGGTAAAACTTTTTATGGCAGAAGGATCTGCAATTGATACTGGA GTTA TTG 

40 AATAGTTGCCCTATATTTGCACAACCGATATAACAATATTCAATTACCTTCCGCTTTAGGATTCTTTG GA 

CGCTTCGTTCCTATTGTT AC ATCGTTCTCTTCT ATCITG ATTGGCITTGTCrrC 1' n'GTTATTTGGC CACCT TTCCAA 
CAACinrinaiTTCTACAGGTGGATATATTTCTCAGGCGGGTCCAATTGGAACTTTTCTATATGGAl 
ACITTCTGGAGCAGTAGGCTTACATCATATAATTTACCCTATGTTTTGGTATACTGAACTTGGTGGTGTTGAAACT 
TTGCAGGACAAACAGTGGTTGGAGCTCAAAAAATATTTTTTGCTCAATTAGCCGATTTGGCCCATTCTGGATTATT 

45 TACAGAAGGAACAAGGTTTTTTGCAGGTCGTTTCTCAACAATGATGTTCGGTTTACCGGCT 

TACCATAGTGTTCCTAAAAATCGTCGTAAAAAATACGCGGGTTTGTTT^ 1 1 lATTAC 

CGGTATTACAGAACCAATTGAATTTATGTTTCTATTCGTCAGTCCGGTTCTATATGTTGTTCACGCATTCOT 

GTGTTAGCrTCmATTGCAGACGTCTTAAATATTTCAATAGGAAACACATT^ 

TT ATTTGG AATTTTGCAGGGGAACGCTAAGACGAATTGGGTTCTTCAGATTCCATTTGGACT^ 

50 GTATTATATTATTTTTAGATGGTTCATTACTCAATTCAACGTTCTAACGCCAGGGCGAGGAGAAGAAGTAGATTCT 
AAAGAAATTTCTGAATCCGCAGATTCAACTTCAAATACTGCAGATTATTTAAAACAGGATAGCCTACAAATTATCA 
GAGCCTTGGGTGGATCAAATAATATAGAAGATGTAGATGCTTGTGTGACACGTTTACGTGTAGCTGTAAAAGAAG 
TTAATCAAGTTGATAAAGCACTITTAAAACAAATTGGTGCAGTTGATGTCITAGAAGTGAAGGGTGGCAT^ 
AATCT ATGGAGCAAAAGCAATCTT AT AT AAAAAT AGT ATT AATGAAATTTTAGGTGTAGATG ATT AA 

55 

4188.11 

ATGAAATTTAGAAAATTAGCTTGTACAGTACnTGCGGGTGCTGCGGTTCriTGGTCTTGCTGCT^ 
GCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTGGTGGGCATTCCCAGTAT 
TT ACCC A AG AAAA AACTG GTG ACG GTGTTG G A ACTT ATG AAA AATC A ATC ATCG AAG CGTTTG AA AAAGC A A ACC 

60 CAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCCTGAAAAAATCACAACAGCCATCGAAGCAG 
GAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTATCATCCAATACGGTAAAAACGGTAAATTGGCTGAGT 
TG AATG A CCTCTTC AC AG ATG A ATTTGTT A A AG ATGTC AA C AATG A A A AC ATCGT AC A AG C A AGT AAAG CTGG AG 
ACAAGGCITATATGTATCCGATTAGTTCTGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGG 
AGTAGCAAACCTTGTAAAAGAAGGTTGGACAACTGATGATTTTGAAAAAGTATrGAAAGCACTTAAAGA CAAG GG 

65 TTACACACCAGGTTCATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACACGTGCCTTTATCTCTAACCTTTAT 
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AGCGGTTCTGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGAAAAA 

GCAACTAGCTGGATTAAAGACAATTTGATCAATAATGGTTCACA^ 

GCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCA^ 

AGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCTCTTGAGTACCTTGTAAACGGG 
5 TTTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCTGCATCTAAGAAATTCATCCAGTTTATCGCAGATGAC 
AAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCTTTCCCAGTCC 

ACAAACGCATGGAAACAATCAGCGGCTGGACTCAATACTACTCACCATACTACAACACTATTGATGGATTTGCTG 
AAATGAGAACACITTGGTTCCCAATGTTGCAATCTGTATCAAATGGTGACGAAAAACCAGCAGATGCTTTG 
CCTTC ACTG AAAAAG CG A A CG A AAC AATC A AA A A AG CT ATG AA AC AAT AG 

10 

4188.12 

ATGCAATCTACAGAAAAAAAACCATTAACAGCCTTTACTGTTATTTCAACAATCATTT^ 

TGTTCATCTTTCCATTCTACTGGATTTTGACAGGGGCATTCAAATCACAACCTGATACAATTGTTATT 

TGGTTCCCTAAAATGCCAACCATGGAAAACTTCCAACAACTCATGGTGCAGAACCCTGCCTTGCAATGGATGTGG 

15 AACTCAGTATTTATCTCATTGGTAACCATGTTCTTAGTTTGTGC 

AAAACGTTTCTATGGTCAACGCATTCTATTTGCTATCTTTATCGCTGCTATGGCGCTTC^ 
TACCATTGGTACGTATCGTCAACTTCATGGGAATCCATGATACTCTCTGGGCAGTTATCITGCCTTTGATTO 
GCCATTCGGTGTCTTCCTCATGAAACAGTTCAGTGAAAATATCCCTACAGAGTTGCTTGAATCAGCTAAAATCGAC 
GGTTGTGGTGAGATTCGTACCTTCTGGAGTGTAGCCTTCCCGATTGTGAAACCAGGGTTTGCAGCCC^ 

20 TTACCTTCATCAATACTTGGAATGACTACTTCATGCAATTGGTAATGTTGACTTCACGTAACAATTT 
CITGGGGTTGCGACCATGCAGGCTGAAATGGCAACCAACTATGGTTTGATTATGGCAGGAGCT 
TTCCAATCGTCACAGTCTTCCTAGTCTTCCAAAAATCCTTCACACAGGGTATTACTATGGGAGCGGTCAAAGGATA 
A 

25 4191.1 

ATGAAAAAAACTTTTTTCTTACTGGTGTTAGGCnT 

ATAAACTCTTATCAAGGGGATTTGTATATTCATGCAGACAATACGGCAGAGTTTAGACAGAAGATAGTTTACCAGT 

TTGAGGAGGACTITAAGGGCCAAATCGTGGGACTTGGACGTGCTGGTAAGATGCCTAGCGGGTTTGACATTGACC 

CTCATCCAAAGATTCAGGCCGCGAAAAACGGTGCAGAACTAGCAGATGTGACTAGCGAAGTAACAGAAGAAGCG 

30 GATGGTTATACTGTGAGAGTCTATAATCCAGGTCAGGAGGGCGACATAGTTGAAGTTGACCTCGTCTGGAACTTA 
AAAAATTTACTTTTCCTTTATGATGATATCGCTGAATTAAATTGGCAACCTCTGACAGATAG 
A AAAGTTTG AATTTC ATGT AAG GGG AG A C AAGGGG GCTG AA AAACTCTTTTTCCAT AC AGGG AA ACTC AG 
AGGGAACGATTGAAAAGAGTAACCnTGATTATACTATCCGTTTAGACAATCITCCGGCTAAGCGTGGAGTTGAG^ 
G C ATG CCTATTGG CCTCGG ACCG ATTTTGCT AG CG CT AGGG ATC AG GG ATTG AAAG GG AATCGTTT AG AAG AGTTT 

35 AAT AAG AT AG AAG ACTCG ATTGTT AG AG AAAA AG ATC AG AGT AAAC AACTCGTT ACTTG GGTCCTCC CTTCG ATC 

CTTTC C ATCTCCTTGTT ATTG AGTGTCTGCTTCT ATTTT ATTT ATAG AAG AAAG ACCACTCCTTCAGTCAAAT ATG C 
CAAAAATCATCGTCTCTATGAACCACCAATGGAATTAGAGCCTATGGTTTTATCAGAAGCAGTCrACTCGACCTCC 
TTGGAGGAAGTGAGTCCCTTGGTCAAGGGAGCTGGAAAATTCACCTTTGATCAACTTATTCAAGCTACC^ 
ATGTGATAGACCGTGGGAATGTCTCTATCATTTCAGAAGGAGATGCAGTTGGTTTGAGGCTAGTAAAAGAAGATG 

40 GTTTGTCAAGCTTTGAGAAAGACTGCCTAAATCTAGCTTTTTCA 

TGCGGATTACAAGGTATCTGATAGTCnTrATCGTAGAGCCAAAGTTrCTGATGAAAAACGGATTCAAGCAAGAGG 
GCTTCAACTCAAATCTTCTTTTGAAGAGGTATTGAACCAGATGCAAGAAGGAGTGAGAAAACGAGT^ 
GGGCrCCCAGATTATTATCGTCCTTrAACTGGTGGGGAAAAGGCCTTGCAAGTGGGTATGGGTGCCTTGACTA 
TGCCCCTATTTATCGGATTTGGTTTGTTCTTGTACAGTTTAGACG7TCATGGCTATCTTTACCT 

45 CTTGGTTTTCTAGGGTTAGTTTTGTCTGTTTTCTATTATTGGAA 

TGAAGCGGGAGCTGAGGTCTACTATCTCTGGACCAGTTTTGAAAATATGTTGCGTGAGATTGCACGATTGGATCAG 
GCTGAACTGGAAAGTATTGTGGTCTGGAATCGCCTCTTGGTCTATGCGACCTTATTTGGCTATGCGGACAAGGTTA 
GTCATTTGATGAAGGTTCATCAGATTCAAGTGGAAAATCCAGATATCAATCTCTATGTAGCTTATGGCTGGCACAG 
T ACGTTTT ATC ATTC A A C AG C AC AA ATG AG CC ATT ATG CT AG TGTCG C AAAT AC AG C A AG C ACCT ACTCTGT ATCT 

50 TCTGGAAGTGGAAGTTCTGGTGGTGGCTTCTCTGGAGGCGGAGGTGGCGGCAGTATCGGTGCCrTTTAA 

4191.2 

ATGAAAAAAGTAAGAAAGATATTTCAGAAGGCAGTTGCAGGACTGTGCTGTATATCTCAGTTGACAGCi'rrrTCTT 
CG AT AGTTG CTTT AGCAG AAACG CCTG A AACC AGTCC AG CG AT AG G A A A A GTAGTG ATT AAG G AG AC AGGCG A AG 

55 G AG G AG CGCTTCT AGG AG ATGCCGTCTTTG AGTTG A A A AAC A AT A CG G ATGGC AC AACTGTTTCGC AAAGG AC AG 

AGGCGCAAACAGGAGAAGCGATATTTTCAAACATAAAACCTGGGACATACACCTTGACAGAAGCCCAACCTCCAG 
TTGGTTATAAACCCTCTACTAAACAATGGACTGTTGAAGTTGAGAAGAATGGTCGGACGACTGTCCAAGGTGAAC 
AGGTAGAAAATCGAGAAGAGGCTCTATCTGACCAGTATCCACAAACAGGGACITATCCAGATGTTCAAACACCTT 
ATCAGATTATTAAGGTAGATGGTTCGG AAAA AAACGGACAGCACAAGGCGTTGAATCCGAATCCAT ATG AACGTG 

60 TGATTCCAGAAGGTACACnTTCAAAGAGAATTTATCAAGTGAATAATTTGGATGATAACCAATATGGAATCGAATT 
GACGGTTAGTGGGAAAACAGTGTATGAACAAAAAGATAAGTCTGTGCCGCTGGATGTCGTTATCTTGCTCGATAA 
CTCAAATAGTATGAGTAACATTCGAAACAAGAATGCTCGACGTGCGGAAAGAGCTGGTGAGGCGACACGTTCTCT 
TATTGATAAAATTACATCTGATTCAGAAAATAGGGTAGCGCTTGTGACnTATGCTrCCACTATCnTGATGGG 
G AGTTT AC AGTAG AAAA AGG G GT AGCAG AT A A AAACG G A AAGCG ATTG A ATG ATTCTCTTTTTTG G A ATT ATG AT 

65 CAGACGAGTTTTACAACCAATACCAAAGATTATAGTTATTTAAAGCTGACTAATGATAAGAATGACATTGTAGAAT 
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TAAAAAATAAGGTACCTACCGAGGCAGAACACCATGATGGAAATAGATTGATGTACCAATTCGGTGCCACTTTTA 
CTCAGAAAGCTTTGATGAAGGCAGATGAGATTTTGACACAACAAGCGAGA 

ATATTACGGATGGTGTCCCAACTATGTCGTATCCGATTAATTTTAATCATGCTACGTTTGCTCCATCATATCAAAAT 
CAACTAAATGCATTTTTTAGTAAATCTCCTAATAAAGATGGAATACTATTAAGTGATTTTATTACG 
5 GTGGAGAACATACAATTGTACGCGGAGATGGGCAAAGTTACCAGATGTTTACAGATAAGACAGTTTATGAAAAAG 
GTGCTCCTGCAGCTTTCCCAGTTAAACCTGAAAAATATTCT^ 

TCCAATTAATGGTGGATATATTTGGCTTAATTGGAGAGAGAGTATTCTGGCITATCCGTTT^ 

AAATTACCAATCATGGTGACCCTACAAGATGGTACTATAACGGGAATATTGCTCCTGATGGGTATGATGTCTTTAC 

GGTAGGTATTGGTATTAACGGAGATCCTGGTACGGATGAAGCAACGGCTACTAGTTTTATGCAAAGTATTTCTAGT 

10 AAACCTGAAAACTATACCAATGTTACTGACACGACAAAAATATTGGAACAGTTGAATCGTTATTTCCACACCATC 
GTAACTGAAAAGAAATCAATTGAGAATGGTACGATTACAGATCCGATGGGTGAGTTAATTGATTTGCAATTGGGC 
ACAGATGGAAGATTTGATCCAGCAGATTACACTTTAACTGCAAACGATGGTAGTCGCTTGGAGAATGGACAAGCT 
GTAGGTGGTCCACAAAATGATGGTGGTTTGTTAAAAAATGCAAAAGTGCTCTATGATACGACTGAGAAAAGGATT 
CGTGTAACAGGTCTGTACCTTGGAACGGATGAAAAAGTTACGTTGACCTACAATGTTCGTTTGAAT^ 

15 TAAGCAATAAATTTTATGATACCAATGGTCGAACAACCTTACATCCTAAGGAAGTAGAACAGAACACAGTGCGCG 
ACTTCCCGATTCCTAAGATTCGTGATGTGCGGAAGTATCCAGAAATCACAATTTCAAAAGAGAAAAA^ 
ACATTGAGTTTATTAAGGTCAATAAAAATGATAAAAAACCACTGAGAGGTGCGGTCTTTAGTCTTCAAAAACAAC 
ATCCGGATTATCCAGATATTTATGGAGCTATTGATCAAAATGGCACTTATCAAAATGTGAGAACAGGTGAAGATG 
GT AAGTTG ACCITT A AA A ATCTGTC AG ATGGG AAAT ATCG ATT ATTTG AAAATTCTG AACC AGCT 

20 CGTTCAAAATAAGCCTATCGTTGCCITCCAAATAGTAAATGGAGAAGTCAGAGATGTGACTrCAATCGTTCCAO 
GATATACCAGCGGGTrACGAGTTTACGAATGATAAGCACTATATrACCAATGAACCTATTCCTCCAAAGAGAGAA 
TATCCTCGAACTGGTGGTATCGGAATGTTGCCATTCTATCTGATAGGTTGCATGATGATGGGAGGAGTTCTATTAT 
ACACACGGAAACATCCGTAA 

25 

4191.3 

ATGAAATCAATCAACAAATTTTTAACAATGCTTGCTGCCTTATTACTGACAGCGAGTAGCCT^ 
CAGTrmGCGGCTGGGACGACAACAACATCTGTTACCGTTCATAAACTATTGGCAACAGATGGGGATATGGATA 

30 AAATTGCAAATGAGrTAGAAACAGGTAACTATGCTGGTAATAAAGTGGGTGTTCTACCTGCAAATGCAAAAGAAA 
TTGCCGGTGTTATGTTCGTTTGGACAAATACTAATAATGAAATTATTGATGAAAATGGCCAAACTCTAGGAGTGAA 
TATTGATCCACAAACATTTAAACrCTCAGGGGCAATGCCGGCAACTGCAATGAAAAAATTAACAGAAGCTGAAG 
AGCT AAATTT AAC ACGG C AAATTT ACCAG CTG CT AAGT AT A AAATTTATG AAATTC ACAGTTT ATCAA CTT ATGTC 
GGTGAAGATGGAGCAACCirAACAGGTTCTAAAGCAGTTCCAATTGAAATrGAATTACCATTGAACGATGTTGTG 

35 GATGCGCATGTGTATCCAAAAAATACAGAAGCAAAGCCAAAAATTGATAAAGATTTCAAAGGTAAAGCAAATCCA 
GATACACCACGTGTAGATAAAGATACACCTGTGAACCACCAAGTTGGAGATGTTGTAGAGTACGAAATTGTTACA 
AAAATTCCAGCACTTGCTAATTATGCA^CAGCAAACTGGAGCGATAGAATGACTGAAGGTTTGGCATTCAACA^ 
GGTACAGTGAAAGTAACTGTTGATGATGTTGCACTTGAAGCAGGTGATTATGCTCTAACAGAAGTAGCAACTGGTT 
TTGATTTGAAATTAACAGATGCTGGTTTAGCTAAAGTGAATGACCAAAACGCTGAAAAAACTGTGAAAATCAC^ 

40 ATTCGGCAACATTGAATGACAAAGCAATTGTAGAAGTACCAGAATCTAATGATGTAACATTTAACTATGGTAATA 
ATCCAGATCACGGGAATACTCCAAAGCCGAATAAGCCAAATGAAAACGGCGATTTGACATTGACCAAGACATGGG 
TTGATGCTACAGGTGCACCAATTCCGGCTGGAGCTGAAGCAACGTTCGATTTGGTTAATGCTCAGACTGGTAAAGT 
TGTACAAACTGTAACTTrGACAACAGACAAAAATACAGTTACTGTTAACGGATTGGATAAAAATACAGAATATAA 
ATTCG1TGAACGTAGTATAAAAGGGTATTCAGCAGATTATCAAGAAATCACTACAGCTGGAGAAATTGCTGTCAA 

45 GAACTGGAAAGACGAAAATCCAAAACCACTTGATCCAACAGAGCCAAAAGTTGTTACATATGGTAAAAAGTTTGT 
CAAAGTTAATGATAAAGATAATCGTTTAGCTGGGGCAGAATTTGTAATTGCAAATGCTGATAATGCTGGTCAATAT 
TT AGC ACGT AAAGCAG AT A AAGTG AGTC A AG AAG AG AAG C AGTTGGTTGTT AC AA C AA AG G ATGCTTT AG AT AG A 
GC AGTTGCTG CTT AT A ACG CTCTT ACTG C A C AAC AAC AAACTC AG C A AG A AAA AG AG AAAGTTG AC AAAG CTC AA 
GCTGCTTATAATGCTGCTGTGATrGCTGCCAACAATGCATTTGAATGGGTGGCAGATAAGGACAATGAAAATGTTG 

50 TGAAATTAGTTTCTGATGCACAAGGTCGCTTTGAAATTACAGGCCITCTTGCAGGTACAT ATT ACTT AG AAG A AAC 

AAAACAGCCTGCTGGTTATGCATTACTAACTAGCCGTCAGAAATTTGAAGTCACTGCAACTTCTTATTCAGCGACT 
GGACAAGGCATTGAGT ATACTGCTGGTTCAGGT AAAG ATGACGCTACAAAAGTAGTCAACAA AAA AATCACTATC 
CCACAAACGGGTGGTATTGGTACAATTATCTTTGCTGTAGCGGGGGCTGCGATTATGGGTATTGCAGTGTACGCAT 
ATGTTAAAAACAACAAAGATGAGGATCAACTTGCTTAA 

55 

4191.4 

ATGACAATGCAGAAAATGCAGAAAATGATTAGTCGTATCTTCITTGTTATGGCTCTGTGTTriTCT 
TGCACATGCAGTCCAAGCGCAAGAAGATCACACGTTGGTCTTGCAATTGGAGAACTATCAGGAGGTGGTTAGTCA 
ATTGCCATCTCGTGATGGTCATCGGTTGCAAGTATGGAAGTTGGATGATTCGTATTCCTATGATGATCGGGTGCAA 
60 ATTGTAAGAGACTTGCATTCGTGGGATGAGAATAAACTTTCTTCT^ 

TTGAGAATCAGATTGAAGTATCTCATATTCCAAATGGTCTTTACTATGTTCGCTCrATTATCCAGACGGATGCGGT 
TTCTTATCCAGCTGAATTTCTTTTTGAAATGACAGATCAAACGGTAGA 

GATACAATGACAACAAAGGTGAAGCTGATAAAGGTGGATCAAGACCACAATCGCTTGGAGGGTGTCGGCTTTAAA 
TTGGTATCAGTAGCAAGAGATGTTTCTGAAAAAGAGGTTCCCn"GATTGGAGAATACCGTTACAGTTCTTCTG 
65 AAGT AGGG AG AACTCTCT AT ACTG AT A A AA ATGG AG AG ATTTTTGTG AC A AATCTTCCTCTTGG G AACT ATCGTTT 
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CAAGGAGGTGGAGCCACTGGCAGGCTATGCrGTTACGACGCTGGATACGGATGTCCAGCTGGTAGATCATCAGCT 

GGTGACGATTACGGTTGTCAATCAGAAATTACCACGTGGCAATGTTGACTTTATGAAGGTGGATGGTCGGACCAA 

TACCTCTCTTCAAGGGGCAATGTTCAAAGTCATGAAAGAAGAAAGCGGACACTATACTCCTGTTCTTCAAAATGGT 

AAGGAAGTAGTTGTAACATCAGGGAAAGATGGTCGTTTCCGAGTGGAAGGTCTAGAGTATGGGACATACTATTTA 

TGGGAGCTCCAAGCTCCAACTGGTTATGTTCAATTAACATCGCCTGTTTCCTTTACAATCGGGAAAGATACTCGT^ 

AGGAACTGGTAACAGTGGTTA AAAA TAACAAGCGACCACGGATTGATGTGCCAGATACAGGGGAAGAAACCCTTG 

TATATCTTGATGCTTGTTGCCATTTTGTTGTTTGGTAG 

4191.5 

ATGAGCCACATATACXTATCTATTTTCACAAGTCTCTTGCTGATGCT^ 

AATATTTACGCATCGGTATGGAAGCAGCATATGCTCCCTTTAACTGGACCCAGGATGATGATAGCAACGGAGCTG 

TCAAAATCGATGGGACCAATCAGTATGCCAACGGATACGATGTTCAAATCGCCAAGAAAATCGCTAAGGACTTAG 

GTAAAGAACCTTTGGTTGTTAAAACCAAGTGGGAAGGTCTAGTCCCTGCCCTTACTTCTGG 

TATCGCAGGTATGAGTCCAACTGCAGAACGCAAACAAGAAATTGCCTTTTCGAGCAGTTACTATACTAGCGAACC 

AGTTTTGCTTGTCAAAAAAGATTCTGCCTACGCAAGTGCTAAA 

TCTC AAC AAGG GGTCT ACCTTT AT AA CTTG ATTG C AC AAAT C CC AGGTG CT AAAAAAG AAAC AG CC ATGGG AG AC 

TTCGCTCAAATGCGACAAGCrCTTGAGGCTGGTGTCATTGATGCTTATGTTTCTGAACGTCCAGAA^ 

CTGAAGCTGCGAACTCTAAGTTCAAGATGATTCAAGTAGAACCTGGTTTCAAAACTGGGGAAGAAGATACAGCT 

TCGCTATCGGGCTTCGTAAAAATGACAATCGT ATT AG CC AAAT CAATG CCAGC ATTG AAACCATTTCAAAAGATG 

ACC AAGTT GCCTTGATGGATCGTAT GATCA AGGAACAACCTGCCGAAGCTACAACAACTGAAGAGACTA GCAGTA 
GTTTCTTTAGCCAAGTTGCTAJVAATTCTTTCT 

ATCTCTATCGTCGGAACCATCATAGGTCTCATTATTGGACITGCCATTGGTGTCITCCGTACrGCTCCT^ 

AAACAAAGTCATTTACGGCCrACAAAAACTAGTCGGCTGGGTTCrCAATGTCTACATTGAAATTTTCCGTC 

CCAATGATTGTTCAATCGATGGTTATCTACTATGGAACTGCCCAAGCTTTCGGGATCAACC 

TTGACCGTACACTGGCTGCTATCTTCATCGTTTCAATCAATACCGGTGCCTACATGACTGAAATCGTCCGTGGTGG 

T ATCCT AG C AGTTG AC A AGGG AC AATTTG AAG CTG CG ACTGCTCTTGGT ATG ACCC AT AACCAG ACC ATG CGT AA 

GATTGTCCTACCTCAGGTAGTCCGTAACATCCTACCTGCAACTGGTAATGAATTTGTCATCAATATCAAAGATACA 

TCTGTATTGAACGTTATCTCTGTTGTCGAACrTTATTTCTCAGGAAATACCGTGGCAACAC 

CCAGACATTTACAATCATCGCCGTGATTTACITTGTCCTCACCTTCACCGTAACACGT ATCCT ACGCTTTATCGAGC 
GCAGAATGGACATGGATACCTACACTACAGGTGCTAACCAAATGCAAACGGAGGATTTGAAATAA 

4191.6 

ATGACACAAGCAATCCTrGAAATTAAACACCTCAAAAAATCCTATGGACAAAACGAAGTGCrAAAAGACATTTCA 
CTCACTGTCCACAAGGGAGAGGTCATCTCTATCATCGGAAGCTCTGGAAGCGGAAAATCGACCTTCCTACGCTCC 
ATT AACCT ACTTG AAAC ACC AACTG ATGG AC A A ATCCTTT ATC ATG G AC AA AACGTCCTCG AAAAAGG CT ATG A C 
CTCACGCAATACCGTGAAAAGTTGGGGATGGTTTTCCAATCCmAAOT 

ACACAATCGTCGCTCAGACAACTGTCCTAAAACGCGAACGCACAGAAGCTGAAAAGATTGCCAAAGAAAACCTG 

GAAAAGGTCGG C ATGGG AG AACG CTACTGGC AAG CC AAAC CAAAACAACTCTCAGGTGGTCAAAA AC AACGTGT 

GGCCATCGCTCGTGCCCTCTCCATGAATCCGGACGCTATTCTCTTTGATGAACCAACATCAGCTCTCGATCCAGAA 

ATGGTTGGAGAAGTCCTCAAAATCATGCAGGACCTGGCTCAGGAAGGCTTGACCATGATTGTCGTAACCCATGAA 

ATGGAATTTGCCCGTGATGTCTCTCACCGTGTTATCTTTATGGATAAGGGCGTGATCGCTGAAGAAGGTAAACCAG 

AAGACCTCnTCACCAATCCTAAAGAAGACCGAACAAAAGAGTTCCTTCAACGCTATCTCAAATAA 

4192.3 

ATGAAAAAGTATCAACTTCTATTCAAAATAAGTGCAGTCTTCTCTTACTTA r 

GACGCTTATCGTCCAAAACTATTGGCAATTTTCTTCTCAGATAGGCAATTTATTCTGGAT^ 

TACTTTTTATTGGAGTCATGATTGTGGTTCnTGTTAAGACAGGCCATGGTTAT^ 

ATG G CTTTG GT ATTCG ATTTTG AC AGT ATT AGTG CT A GTGTTCC AG ATCTCTTTT A ACGTTC AG AC AG CT A AA CATG 
TTCAGTCAACTGCGGAAGGTTGGGCTGTATTGATTGGTTATAGTGGGACTAACTTTGCAGAGCTAGGTATTTATAT 
AGC CCTGTTCTTTCTG GTTCC ACTG ATGG A AG A ATTG ATTT AT AG AG G ATT ACTG C AAC ATG CI'lTCl T 1' AAG C ATT 
CGCGATTTGGTCTTGATTTGcn 'Cl'l CCTTCTATTTTATTTGCTCTCCCTCATTTTTCAAGCCTGCCTAGTCTGTTA^ 
ATATCnTCGTCTTTGCAACAGTTGGAATCATCrrTGCTGGTTTGACCCGCTATACCAAGAGCAm 
GCGGTGCATGTGATCAATAATATTGTAGCGACCrrCCCGriMUnGCTCACrinM'CTACATAGGGTCTTGGGGT^ 

4193.1 

ATG AAC A AG AA AC AATGG CT AGGTCTTG GCCT AGTTG C AGTG GC AG C AGTTG G ACTTGCTG C ATGTG GT AACCG C 

TCTTCrCGTAACGCAGCTTCATCTTCTGATGTGAAGACAAAAGCAGCAATCGTCACTGATACTGGTGGTGTTGATG 

AC A AATC ATTC A A CC A ATC AG CTTG GG AAG GTTTG C AGGCTTGGG GT AA AG AAC AC AATCTTTC A AAAG AT AA CG 

GTTTCACrrACTTCCAATCAACAAGTGAAGCTGACTACGCrAACAACTTGCAACAAGCGG 

AATCTTCGGTGTTGGTTTTGCCCTTAATAATGCAGTTAAAGATGCAGCAAAAGAACACACT^ 

TTG ATTG ATG ATGTG ATT AAAG A CCAA AAG AATGTTG CG AGCGT AACTTTCG CTG AT AATG AGTCAGGTT A CcTTG 

CAGGTGTGGCTGCAGCAAAAACAACTAAGACAAAACAAGTTGGTTTTGTAGGTGGTATCGAATCTGAAGTTATCT 

CTCGTTTTGAAGCAGGATTCAAGGCTGGTGTTGCGTCAGTAGACCCATCTATCAAAGTCCAAGTTGACTACGCTGG 

TTCATTTGGTGATGCGGCTAAAGGTAAAACAATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCA 

AGTAGCTGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTGAAAATGAAAA 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



PCT/GB99/024S1 



67 



AGTTTGGGTTATCGGTGTTGATCGTGACCAAGAAGCAGAAGGTAAATACACTTCTAAAGATGGCAAAGAATCAAA 
CTTTGTTCTTGTATCTACTTTGAAACAAGTTGGTACAACTGTAAAAGATATTT 

TTCCCTGGCGGTCAAGTGATCGTTTAO'CATTGAAGGATAAAGGGGTTGACTTGGCAGTAACAAACCm 
AAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAAAATCCTTGATGGAAGCGTAAAAGT 
5 TCCTGAAAAATAA 

4193.3 

ATGTCTAAAAAATTACAACAAATTTCGGTTCCCTTGATTTCTGTATTCCTAGGAATTT^ 

GTGGATCTTCGGTTATGATGCTATTTGGGGCTACGAAGAATTGTTCTATACAGCCTTTGGCAGTCTGCGTGGGAT^ 
10 GGAGAAATCTTCCGTGCTATGGGTCCTCTGGTCTTGATTGGTCTTGGTTTTGCCGTTGCC^ 

TAACGTCGGACTTCCTGGTCAGGCTTTGGCAGGTTGGATTCTCAGTGGTTGGTTTGCCCTGTCGCATCCAGATATG 
CCCCGTCCCTTGATGATTCTAGCAACCATCGTGATTGCCTTGATTGCTGGTGGGATTGTCGGAGCGATTCCAGGTA 
TTCTTAGGGCCTATCTAGGGACGTCAGAGGTTATTGTAACCATCATGATGAACTACATTGTCTTGTATGTAGGGAA 
TGCCTTTATCCATGCTTrCCCTAAAGACTTCATGCAAAGTACAGATTCGACCATTCGTGTTGGGGCTAATC 
15 TATCAGACACCTTGGTTGGCTGAGTTGACTGGTAACTCACGGATC 

TGCAGTTATTTGGTTCATGCTCAAGAAAACAACTCTTGGTTTTGAAATCCGTGCAGTTGGTCTT 
CAGAATATGCTGGTATTTCTGCCAAGCGGACTATTATCCTATCTATGATTATTTCAGGTGCCTTGGCAGGTCTTGGT 
GGAGCTGTTGAAGGTTTGGGAACCTTCCAGAACGTCTATGTTCAAGGTTCGTCATTAGCTATCGGATTT 
TGGCGGTTAGTTTGCTTGCGGCCAACTCACCAATTGGTATACTCTTTGCAGCCTTCCTATTTGGCGTTCT 
20 GGGGCTCCTGGTATGAATGCGGCGCAGGTACCATCTGAGCTTGTCAGCA 

TC AGTGTTC ATT ACCTTATCG A ACG CTTTGTC AAACCG AAAAAACAAGTT A A AG G AGGT AAGT AA 

4194.1 

ATGGG AGTG AAAAAG A AACT AAAGTTG ACT AGTTTGCT AG G ACTGTCTCTGTT A ATC ATG A C AG CC TGTGCG ACT 
25 AATGGGGTAACTAGCGATATTACAGCCGAATCGGCTGATTTTTGGAGTAAATO 
TTCGCTTTTTATCGTTTGATATTAGTATCGGAGTGGGGATTATTCTCT^ 

CCAGTCTTTCAGGTGCAAATGGTGGCTTCrAGGAAAATGCAGGAAGCrCAGCCACGCATTAAGGCGCTTCGAGAA 
CAATATCCAGGTCGAGATATGGAAAGCAGAACCAAACTAGAGCAGGAAATGCGTAAAGTATTTAAAGAAATGGG 
TGTCAGACAGTCAGACTCTCirrGGCCGATTTTGATTCAGATGCCGGTTATTTTGGCCCTGTTCCAA 
30 G AGTTG A C " I ' t ' 1 ' 1 ' I AAAG AC AGGTC ATTTCTT ATGG ATT AACCTTGGT AGTGTGG ATACAACCCTTGTTCTTCCG ATT 

TTAGCAGCAGTATTCACCTTTTTAAGTACTTGGTTGTCCAACAAAGCTTTGTCTGAGCGAAATGGCGCT 
CGATGATGTATGGGATTCCAGTCTTGATTTTTATCTTTGCAGTTTATGCGCCAGGTGGAGTCGCCCTATACTGGAC 
AGTGTCTAATGCTTATCAAGTCTTGCAAACCTATTTCITG 

GTAGTACAGGCACAAAAAGATTTGGAAAATAGAAAAAGAAAAGCCAAGAAAAAGGCTCAGAAAACGAAATAA 

35 

4194.4 

ATGGTTATCGATCCATTrGCTATCAACGAACTAGACTATTACTTAGTTTCACACrrCCAC 

CATACACAGCTGCAGCAATTCTCAATAATCCTAAGTTAGAGCATGTTAAGTTTATCGGTCCTTACCACTGTGGACG 
AATCTGGGAAGGATGGGGTGTTCCAAAAGAACGTATCATCGTTGTTAAACCAGGTGACACTATCGAATTAAAAGA 
40 TATGAAGATTCATGCAGTAGAATCATTTGACCGTACTTGCrrGGTAACTCTCCCAGTGAACGGTG CTGAT GAGACA 
GGCGGTGAACTTGCTGGCITGGCTGTTACAGATGAAGAAATGGCTCAAAAGG 
CAGGTGGAACCATCTATCATGGTGCAGATTCTCACTTCTCAAACT^ 

TGATGTTGCTITGAATAACTATGGTGAAAATCCGGTAGGTATCCAAGACAAAATG ACATCTATCGACCll L 1 1 CGT 
ATGGCAGAAAATCTGCGTACCAAAGTCATTATCCCAGTTCACTATGATATCTGGTCTAA^ 
45 ATGAGATTCTAGAACTTTGGAAAATGCGAAAAGATCGCTTGCAATACGATTTCCA 

CGGTAAGTACACTTATCCTCAAGATCAACACTTAGTAGAATACCATCATCCACGTGGTTTTGATGATTGTTTTG^ 
CAAGACTCTAACATTCAATTTAAAGCTTTGCTATAA 



50 

4196.2 

ATGTTCCrrTCAGGCTGGTTGTCTAGTTTTGCTAATACTTATATCCATGATrTACT^ 
TCCATTITTAAATGCCTTTGAAAGTGCTATTGCGGCT^ 
55 TTGTTITGGCTTTGATTCCTGTGCGAAAATTAAAATCITTG'lUT 

ATGATTAAGGATATTGGTTATATTCGTACGGATTTGCCAGAGGGCTTTGACTTTACT 

GTATCATCTCAGGAATTGCCTCTCACTGGACnTTTTCAGGTCTAGCTGTAGTA 

TATAAAGGACAGAAGGTTGGCAAGAAACAGGGCCITATTTITCTAGG 

ACTCTCCTTrrGTGGAGTTGGAAACAGAGTTGCCTTTAGCGATTCCAGTGGTTACGGCTATTGCTCT 
60 TATCATGCTTATTGCTTTGTTGAGAAACACAATG AGTTG ATG ACCT AG 

4197.1 

ATGAAGGTGGAACCACGTTGCGACGTCCrTTCGAGGATGTCGCAriTrri^'ATTAGGATACTAATTATGGAGTTGC 
AAG AATT AGTGG A G CGC AGTTGG G C A ATCCG AC A AG CTT ATC AC G AACTG G A AGTT A AGC ATC ATG ATTCC AAGT 
65 GGACGGTAGAAGAAGACCTCTTGGCTTTATCTAATGATATTGGAAATTTCCAACGACTGGTGATGACAAAGCAAG 
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GACGCTACTATGATGAAACACCCTACACACTGGAACAAAAACTTTCAGAAAATATCTGGTGGCTATTAGAACTTT 
CTCAACGTTTGGATATAGACATTCTGACGGAAATGGAAAACTTCCTCTCTGATAAAGAAAAGCAATTGAACGTTA 
GGACTTGGAAGTAG 

4197.4 

ATGCTTGATTGGAAACAATTTTTTCTAGCCTATCTGCGCTCC^ 
ATTTCTTGTCTTACTCTTTCAGTTTTTATTTGCCAGTCTAGGAAm 

GTAACCATATTAl'lUU'rCACTTGGGACATATTGGTGGAAACGCAGGTCTATCGCCAGGAACTTCTCTATGGAGAGA 

GGGAAGCCAAGTCTCCTTTGGAAATAGCTTTAGCAGAAAAATTAGAAGCGCGTGAGATGGAACTCTATCAG 

GGTCAAAAGCAGAAAGAAAACTGACGGATTTGCTGGATTACTATACCTTGTGGGTCCATCAGATAAAGACCCCCA 

TTGCAGCCAGTCAACTCITAGTTGCAGAAGTGGTCGACCGCCAACTGAAGCAGCAGCTAGAACAGGAAATTTTCA 

AAATCGACTCCTATACCAACCTAGTTTTACAGTACCTGCGTTTAGA 

GGTTCAAATTGAGGACTrGGTCAAGGAAATAATTCGTAAATATGCTCTTTTCm 

AATCTACATGACCTTGATAAAGAAATCGTGACGGATAAAAAGTGGCTGCTAGTGGTTATTGAGCAAATCATCTCA 
AACAGTCTCAAGTACACCAAGGAAGGTGGTCTGGAGATTTATATGGATGACCAAGAGCTTTGTATCAAAGATACG 
GGAATCGGGATAAAAAACAGTGATGTCCTCCGAGTATTTGAACGTGGCTTTTCAGGATACAATGGCCGTTTGACCC 
AGCAGTCCTCTGGACTTGGCCTTTATCTATCTAAGAAAATTTCTGAAGAACTGGGGCACCAGATTCGTATCGAGTC 
TG AGGTCGG AAAAGG AA CG AC AGTG CGG ATTC AGTTTGCTC AAGTG A A CTT AGTCCTTG AGT AA 

4211.2 

ATGGAACTTAATACACACAATGCTGAAATCTTGCTCAGTGCAGCTAATAAGTCCCACTATCCGCAGGATGAACTG 
CCAGAGATTGCCCTAGCAGGGCGTTCAAATGTTGGTAAATCCAGCTTTATCAACACTATGTTGAACCGTAAGAATC 
TCGCCCGTACATCAGGAAAACCTGGTAAAACCCAGCTCCTGAACTTTTrrAACATTGATGACAAGATGCGCTTTGT 
G G ATGTG CCTGGTT ATGG CT ATG CTCGTGTTTCT AA AAAG G AACGTG AAAAGTG GGGGTG C ATG ATTG AGG AGT A 
CTTAACGACTCGGGAAAATCTCCGTGCGGTTGTCAGTCTAGTTGACCTTCGTCATGACCCGTCAGCAGATGATGTG 
C AG ATGT ACG AATTTCTC AAGT ATTATG AG ATTCC AGTC ATC ATTGTG G CG ACC AAG GCGG AC AAG ATTCCTCGTG 
GTAAATGGAACAAGCATGAATCAGCAATCAAAAAGAAATTAAACTTTGACC^^ 
ATCTGTCAGTAAGGCAGGGATGGATGAGGCTTGGGATGCAATCTTAGAAAAATTGTGA 

4211.3 

ATGACAAAGAAACAACTTCACTTGGTGATTGTGACAGGGATGAGTGGCGCAGGGAAAACTGTAGCCATTCAGTCC 
TTCGAGGATCTAGGTTATTTCACCATTGATAATATGCCGCCAGCTCTCTTGCCT^ 

TAAGGAAGACAATCCTAAGTTGGCCTTGGTAGTGGATATGCGTAGCCGTTCIUnClUUUCAGAGATTCAAGCTGTT 

TTGGATGAGTTGGAAAATCAAGATGGTTTGGATTTCAAAATCCTCIT r T r GGATGCGGCrGATAAGGAATTGGTCG 

CTCGTTACAAGGAAACCAGACGGAGTCACCCACTAGCAGCAGACGGTCGTATTTTAGATGGAATCAAGTTGGAAC 

GTGAACTCTTGGCACCTTTGAAAAATATGAGCCAAAATGTGGTGGATACGACTGAACTCACTCCACGTGAGcTGCG 

CAAAACCCXTGCAGAGCAGTTTTCAGACCAAGAACAAGCCCAGTCmCCGTATCGAAGTCATGTCTT^ 

AAGT ATGGAATCCCGATTGATGCGGACTTGGTCTrrGATGTCCGTTTCnTGCCAAATCCCr ATT ATT TACCA GAACT 

GAGAAACCAAA(XiGGTGTGGATGAACCTGTTTATGATTATGTCATGAACCATCCTGAGTCAGAAGA(nTrTATCAA 

CATTrATTGGCCTrGATTGAGCCGATTCTGCCAAGTTACCAAAAGGAAGGTAAGTCCGTTITGACCATTGCCATGG 

GATGTACGGGTGGACAACACCGTAGTGTGGCATTTGCTAAACGCrTGGCGCAGGACTTATCCAAGAATTGGTCTGT 

T AATG AAGGGC ATCG CG AC AAAG ACCG C AG A AAGG A AACG GT A AACCGTTC ATG A 

4211.4 

ATGAGAAAACCAAAGATAACGGTGATTGGTGGAGGGACTGGAAGTCCCGTCATTCTAAAAAGTCTGCGGGAAAAA 

GATGTGGAAATCGCAGCTATCGTGACGGTGGCAGATGATGGTGGTTCTTCAGGTGAACTCCGAAAAAATATGCAA 

CAGTTGACACCGCCAGGTGATCTTCGTAATGTCCITGTGGCCATGTCGGATATGCCTAAGTTrTATGAGAAGGTCT 

TTCAGTATCGGTTCrCTGAGGATGCCGGAGCCTTrGCTGGCCATCCArrGGGAAATCTCATCATTGCT 

AGAAATGCAGGGTTCAACCTATAATGCCATGCAGTTATTGAGCAAATTTTrCCATACAACAGGGAAAATTTATCCT 

TCCAGTGACCATCCITTGACCCTrCATGCAGTCnTrCAGGATGGGACAGAAGTGGCTGGAGAGAGTCATATTGTAG 

ACCATCGAGGCATAATTGACAATGTCTATGTGACCAATGCCCTAAACGATGATACGCCTCTGGCCAGCCGTCGAG 

TAGTGCAGACCATCCTTGAAAGTGACATGATTGTCCTAGGGCCAGGTTCCCTCTTTACCT 

CGTGATTAAGGAAATTGGGCGGGCTCTTTTGGAAACCAAGGCAGAAATTGCCrATGTCTGCAATATCA TGACC CA 
ACGTGGGGAGACGGAACACTTTACAGATAGCGACCACGTGGAAGTCTTGCATCGTCACCTTGGTCGCCCTTTTATC 
G AC ACTGTCTTG GTG A ATATTG AAAAAGTG CCTC AG G AAT AC ATG AATTCC A ACCGTTTTG ATG AAT ACTT AGTG C 
AAGTG G AAC ACG ATTTTGTAG GTCTTTGT AAG C AAGTTTCGCG CGTG ATTTC ATCT AACTTCCTTCGTCTGG AAAA 
TGGCGGTGCCTTCCACGATGGAGATTTGATTGTGGACGAGTTGATGCGCATTATACAGGTGAAAAAATGA 

4213.1 

ATGAAAAATTTGATAAAGTTGCTAATAATTAGATTGATTGTTAACTTAGCAG 

TGTG G C ACGTTAGC AAT AATT ATTCTTCG AG C ATGTTCTT AG G AAT ATTT ATTG C AGT AA ATT ATCT ACCGG ATTTG 
TT ACT AATC1" l"l" IT l'GG ACC AGTT ATTG AC AG AGT A AATCCG C AAAAA ATTCTT AT AAT ATC AATTTTG GTTC A ATT 
AGCAGTGGCTGTAATA i'i'X'Y l" ATTATTATTAAACCAAATATCATTTTGGGTG AT AATG AGTCTAGTGTTT ATTTC AG 
TAATGGCTAGCTCCATAAGTTACGTGATAGAAGATGTGTTGATTCCTCAAGTGGTAGAATATGATAAGATTGTATT 
TGCAAATTCTCTTTTTAGTATTTCGTATAAAGTATTAGATTCTATTTTT 
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CAGTAGGATTTATTTTATTGGTTAAGATAGATATAGGCATATTTTTACTTGCT 

TTT AG A ACT AG C A ATG CG AAT AT AG A A AACn'CTCTTT C AAAT ATT A C AAG AG AG AAGTGTTG C AAGGT AC AAAG 

TTTATTTTAAATAATAAATTATTATTTAAAACCAGTATTTCTT^ 

GTAGTTGTACCGAlTnUTCTATTCGATATTTTGATGGTCC^ 

GGTGGTATATTGGGAAATATGCTAGCGCCAATCGTAATAAAATATTTAAAATCGAATCAAATTGTT GGTGTA TTTC 
TTTTTTTGAACGGCTCAAGTTGGTTAGTAGCAATTGTTATAAAAGACTATACITT 
GTTTTATGTCTAAAGGAGTCTTCAATATTATTTTTAATTCGTTGTACCAACAAATACCT 
AGGGTAAATACTACCATTGATTCTATTATTTCITrTC 

TTTG A AT ATTG AATTAGTGTT AATTG CT ATT AG C AT ACCTT ATTTTTTG' l" 1" 1" rCTTATATTTTTT AT A CGG AT A ATG G 
ATTGAAAGAATTTAGTATATATTAG 

4213.2 

ATGATGTCTAACAAAAATAAGGAAATTCTGATTTTTGCGATTCT 

ATTGCTGGCTTCTTTAATGCCATCTGCCATTGCAAATTATCTTGTTTATGTAGTTTTAGCT 

GTTCAAGGATAGATTGATCCAACAATGGAAGGAGATTAGAAAGACTAAAAGAAAATTCTTCnTTGGA 

AGGATGGCTCTTTCTCATTCTGATGACrGTTGTCTTTGAATTTGTATCAGAGATGTO 

ATGGACAAGGTCTAAATCAGTCTAATATTCAAAGTACCTTTCAAGAACAACCACTACTGATAGCTGI'iMMMGCTTG 
TGTCATTGGACCTCTGGTAGAAGAATTATTTTrCCGTCAGGTCrrATTGCATTACTT 

TACT AAGCATT ATT CTGGTAGGACTTG III' II GCTCTG ACTCATATGCACAGTTTGGCTCTATCAG AGTGG ATTGGT 

GCAGTTGGTTACTTAGGTGGAGGCCTTGCCTTTTCTATTATTTATGTGAAAGAAAAAGAG 

TACTTGTTCACATGTTAAGCAACAGCCTCTCCnTAATCATTTTAGCTATCAGTATAGTAAAATGA 



4224.1 

TTGAAAAAGCCAATTATCGAATTCAAAAACGTCTCTAAAGTTTrTGAAGACAGCAACACCAAGGTrcrCAAAGAC 
ATCAACTTTGAGTTGGAAGAAGGGAAATTCTACACCCTTCTAGGTC^ 

ACATTATTGCAGGTTTACTGGATGCGACGACAGGAGATATCATGCTAGACGGTGTTCGTATCAATGATATTCCAAC 

CAACAAGCGCGACGTACATACCGTCTTCCAATCCTATGCCTTGTTCCCACATATGAATGTGTTTGAAAATGTTGCC 

TTTCCACITCGCn'GCGTAAAATTGATAAGAAAGAAATCGAGCAGCGTGTAGCGGAAGTTCTCAAGATGGTTCAGT 

TGGAAGGTTATGAAAAACGTTCCATCCGCAAACTTTCTGGAGGACAACGTCAGCGTGTGGCCATCGCCCGTGCTA 

TCATCAACCAACCCCGTGTGGTCTTGTTGGACGAGCCTTTATCAGCGCTGGACTTGAAATTGAGAACAGACATGCA 

GTACGAATTGCGTGAATTACAACAACGATTGGGCATTACCITTGTCnTrGTC 

GCCATGAGTGACTGGATTTTCGTTATGAATGATGGCGAGATTGTCCAGTCTGGAACCCCTGTGGACATCTACGATG 

AGCCAATCAACCACTrrGTTGCCACCrTTATCGGGGAGTCAAACATCTTGCCAGGTACCATGATTGAGGACTAOT 

GGTCGAATTTAACGGCAAACGCTTTGAAGCGGTTGATGGTGGGATGAAGCCAAATGAACCTGTTGAG GTCGT TAT 

TCGTCCAGAGGACrTGCGCATTACCCTTCCTGAAGAAGGCAAGCTCCAAGTTAAGGTCGATACCCAGCrriTCCGT 

GGAGTTCATTATGAAATTATCGCCTATGACGAACTTGGAAATGAATGGATGATCCACTCAACCCGTAAGGCTATCG 

TGGGTGAGGAAATCGGTCTGGACTTTGAACCAGAAGACATCCACATCATGCGTCTCAATGAAACCGAAGAAGAGT 

TCGATGCTCGTATTGAGGAGTACGTAGAAATCGAAGAGCAAGAAGCAGGTTTGATCAATGCAATCGAGGAGGAAA 

G AG ATG AAG AAAAC AAG CTCT AA 

4252.1 

ATGAAATCAATGAGAATCTTATTTTTGTTAGCITrAATT 
GCATCTTGTCCnTTAAACAAAGTACAGCTTTTTTCATCGGAAGCATGGTm 

AAATTATCITrATACCCGTAAGCAAGAAGTCCATAGTGTCCTAGCCAGTAAGAAGTCGGTGAAGCrm 

ATGTTACTCTTAATTAATTTGTTAGGAGCTGTTCTTGTTTTGTCAGATAACTTG 

AGAATTAGTTGA CTnTl ^\TTGCCATCClTCnTlTCCTATTTGGGCT 

ATACGTGCGCGA' ITn ' lTro CTATGCTGGACAGAAAAAAGACAGTGTO 

TTAAGAAATCCAATGACCATTGTCTCACITCTGATTTATATTGGACrGGGCTTGTTT^^ 

A AATTC GGTTAAGAAGGAAGTTTCCTTTTATGGTCATATTTTCCGAGATCTTGTATO 

TCTTTTAG 

4252.2 

ATGGTTAAAAAAATTATTGGAATGGTGCTAGCTTTACTTTCT 
TATTTATCAACAAGGGACAGAAACCTTAGCTAAAACCTATAAAAAAAT^ 

GACrGAACCTCTAACCATTCTGTTAATGGGAGTGGACACCGGAAATGTTGAACGAACTGAAACTTGGGTCGGTAG 
AAGTGATAGCATGATCTTGATGACAGTGAATCCTAAAACGAAAAAAACAACAATGATGAGTTTAGAGCGGGATAT 
TCTGACGCGCATTGAATCAGGGAATGGTCAGGCTCATGAAGCGAAACTGAACTCAGCATATGCAGATGGTGGAGC 
AG AG CTTG CT AT AG AAACC ATTC AA A AA ATG ATG AAT ATCC AT ATTG ATCG CT ATGTG ATGGTC AAT ATG AG AGG 
ATTGCAAAAACTAGTGGATGCAGTAGGAGGTATTACAGTCAATAATATCCTAGGTTTCCCAATTTCTATCAGTGAC 
C AAG A AG AATTT AAT A CT ATTTCT ATCG GTGTTGGG G AG C A AC AT ATTG G GG G AG AAG AAG CCCT AGTCT ATGC A 
CGAATGCGTTACCAAGATCCTGAGGGGGATTATGGTCGTCAAAAACGTCAACGTGAAGTTATTCAAAAAGTCATG 
GAAAAAGCTCTCAGTTTAAATAGCATTGGTCATTATCAAGAGATTCTAAAAGCTTTGAGTGACAATATGCAGACC 
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AATATTGATTTGTCTGCAAAAAGTATCCCTAACTTGCTAGGCT 

AGTTGCAGGGTGAAGGAGAGATACTTCAAGGTGTTTCTTACCAGATTGTTTCGAGAGCA CATA TGTTGGAAATGCA 
AAATCT ACTCCG ACGTTCTTTGGG AC AAG AAG AAGTT ACTC AG CTTG A A ACC AATG CGGTTTT ATTTG AAG ATTT A 
TTTGGCAGAGCACCTGTTGGTGATGAAGATAATTAA 

5 

4256.2 

ATGAAAAAACAAGCCTATGTCATTATTGCTCTCACCTCCTTCCT 
AATACTTGATTTTGACTGGTCTATCnTTTTGCACGATGTCGAAAAAACAGAAAAA 
TCAGCATGTCCATGACCTGTCTCTTAGCCCTGTTTTGGCGAGGGATCGAAGAGCTTTCTCT 
10 TAATCTCAAGCGTTTATTAGCAGGGCAAGAAGTGGTTCAGGTTGCAGATCCAGATTTGGATGCCAGTTTCAAGTCC 
TTATCAGGTAAACTTAACCITITGACAGAGGCrCTTCAAAAAGCTGAAAATCAGAGC 

ATCATCGAGAAGGAACGGAAGCGAATTGCTCGGGATTTGCACGATACAGTCAGTCAGGAGTTGTTTGCGGCCCAC 
ATGATTTTATCGGGTATCAGTCAGCAGGCTTTGAAATTGGATAGAGAAAAGATGCAGACCCAGTTGCAGAGTGTC 
ACAGCTATTTTAGAAACAGCCCAGAAGGATTTGCGGGnTTGCTCTTGCATTTGCGAC 

15 AG AG CTTG AT AG AAGGG ATTC AAATTCTTTT AA AAG AG CTTG AG G AC AAG AGTG ATCTT AGG GTT AGTCTC AAG C 

AGAATATGACGAAATTGCCTAAGAAAATCGAGGAGCATATCTTCCGTATCCTGCAAGAGTTGATTAGCAATACCC 
TCCGCCATGCCCAGGCATCTTGCCTAGATGTCTACCTCTATCAGACAGATGTTGAATTGCAACTGAAGGTGGTGGA 
CAATGGGATTGGTTTCCAGTTAGGGAGCTTAGACGACrTGAGTTATGGACrGCGAAATATCAAGGAGCGGGTTG^ 
AGATATGGCTGGAACAGTrCAACTCTTGACAGCTCCCAAGCAAGGGCrGGCGGTTGATATCCGTATrCCCCTGTT 

20 GATAAGGAATGA 

4263.1 

ATGATTGTTTCCATTATTTCTCAAGGATTTGTCTGGGCTATTCTAGGTCT 

AAACTTTCC AG AT ATG ACG AC AG AAG GTTCCTTC CCTCTTG G G G G AG CTGTTG CTGTC ACTTTG AT AACCAAAG G C 
25 GTGAACCCATTTTTAGCGACACTTGTTGCTGTAGGAGCAG 

CAAAAGGGAAGATCCCAACCTTGCTCTCAGGGATTTTGGTGATGACTTCTTGTC ACTC AATCATC 
GGGACGTGCGAATTTAGGCCTGCTTGGAACCAAGCAAATTCAGGATGTTTTGCCITTTGATTCGG 
CTCITGACAGGTCrCATCTTTGTGAGTATTGTTATTGCrCrCATG 

CTATATTGCTACAGGGGATAATCCTGATATGGCTAGAAGTTTCGGGATTCATACTGGACGCATGGAGCTCATGGGC 
30 TTGGTCTTATCAAATGGTGTGATTGCCCTTGCAGGTGCCCTCATTGCTCAGCAA GAAG GTTATGCCGATGTGTCTC 
GAGGGATCGGCGTTATCGTTGTGGCGCTTGCAAGTTTGATTATTGGAGA^ 

AGAGCGTTTGGTTACTATCGTTGTAGGTTCTATCGCTTATCAATTTTTAGTGTGGGCAGTTATCGCACT 

ATACAAGTTACCTTCGTTTATACAGTGCCTTGATTTTAGC^^ 

CTTGAAAGGAGCCAAGTTAAGCAAATGA 

35 

4346.1 

ATGAAAAAAATGAAAGTTTGGTCTACTGTACTTGCAACGGGAGTTGCTCTTACTACACTTGCT 
GTTCAAATTCrACGACTGCrrCTTCATCTGAAGAAAAAGCTGATAAAAGTCAA 

AGTCTCAAATGGTCGTGGTGATTGGTTAACTGCTAAAGCAAAAGAAGCTGGTTrTAATATAAAAATGGTTGATATC 
40 GCTGGCGCTCAATTAGCAGACCGTGTTATTGCTGAGAAGAATAATGCAGTTGCAGATATGGTATTTGGAATTGGTG 
CTGTTGATTCAAATAAAATTAGAGATCAAAAATTACTAGTACAGTACAAGCCTAAATGGTTAGATAAAATTGATC 
AATCTTTATCAGATAAAGATAATTATTATAATCCTGTGATTGTTCAACCATTAGTm 
AAAGAAATGCCTAAAGATTGGACTGAATTAGGTAGTAAGTATAAAGGTAAATATTCAAriTCrc 
GGTACAGGACGGGCAATTCTAGCAAGTATCrTAGTTCGATACCITGATGATAAAGGTGAATTAGGTC 
45 AAGGTTGGGAAGTAGCAAAAGAATATTTGAAAAATGCATACACTCTTCAAAAGGGAGAAAGTTCAATTG 

TGTT AG AC AAAG AAG ATC C AAT AC AAT ATGG AATG ATGTGG GGTTCTGGTG C ATT AGTTGG A C AAA AAG AAC AAA 
ATGTTGTITrCAAAGTTATGACTCCTGAGATTGGTGTACCATTTGTAACTGAA 

TAAAAAACAAGCGTTAGCTAAAGAATTTATTGATTGGTITGGTCAATCAGAAATTCAAGTAGAATATAGT 
CTTTGGATCTATTCCTGCAAATAAAGATGCCCTCAAAGATCTACCrGAAGATACGAAGAAATT^ 
50 AAACCACAAAATATTGACTGGGAAGCTGTTGGAAAGCATTTGGATGAATGGGTAGAAAAAGCTGAATTAGAATAC 
GTACAATAA 

4346.2 

ATG ATT AA ATTTG AT AAT ATTC AAATT AAAT ATG GTG ATTTTGTTG C AATTG AT AATCTG AATTT AG AT AT AC ATG 
55 AAGGGGAATTTTTTACATTTCTTGGGCCnTCAGGATGTGGTAAATCAACTACTTTGAGAGCA^ 

AGATCCATCATCAGGAAGTATTGAAGTTAATGGAACAGATGTCACTCATTTGGAACCTGAAAAGCGTGGAATTGG 
TATTGTATTTCAATCTTATGCGCTATTTCCAACTATGACTGTTTTrGATAATATTG 

AGGTAGCTCCAGATGTTATTAAAGCTAAAGTATCAGCAGTGGCAGCAAAAATTAAGATCTCTGATCAACAGTTAC 
AGCGTAATGTATCAGAATTATCTGGGGGTCAACAACAAAGGGTAGCATTGGCTCGTGCTCTGGTTCTTGAACCTAA 

60 A ATTCTTTGTCT AG ATG AACC ATTGTC AA ACCTTG ACG C AAA ATT A CGTGT AG ATTTG AG A AAAG AGTTG AAAAG A 

CTTC AAAAAG AGTT AG GT ATT ACT ACTTT AT ATGTT ACTC ATG ATC AAG AGG AAGCCTTG ACTTT ATCTG AT AG AA 
TTGCAGTCTTTAACAATGGATACATCGAACAGGTCGGTACACCAGTAGAGATTTATCATAATTCTCAAACTGAATT 
TGTATGTGATTTTATTGGAGATATTAATGTTTTGACCGATGAAACAGTCCACGAAGTATTATTGAAAAATA CAAGC 
GTTTTCTT AG AGG AT A AA AAAGG AT AC ATTCG ATT AG AG A A AGTTCG ATTC A AT CGTG A AACTG AAC AAG ATTTT A 

65 TTCTAAAAGGGACAATTATTGATGTTGAGTTTTCTGGAGTTACAATTCACTATACAAT AAAAGTTTCTGAAAGTCA 
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GATTCTTAATGTAACAAGTATTGATAGTCAGGCTC 
CCATCAGACGTTCTGCAATTTTAA 



HJW.J 

5 ATGCGTCATAAATTAAATTTAAAAGATTGGCTTATTCGTTTAGGGTTAATCTGGTrCT^ 

TCCAAACTTTGATCTAGTAGTGAATGTATTTGTAAAAGGAGGAGAATTTTCCCTTGATGCTGTACATCGTC 
AAATCTCAGAGGGCACTTCAGAGTATTATGAACAGTTTTAAGTTAGCATTTTCACTCATT 
TAGGTATTCTTTCn'GrrCTATTTACAGAGTACTTTGATATTAAAGGTGCTAAAAT^ 
TCTTTAATTTATGGAGGAGTGGTTTTAGCGACTGGATATAAATTTGTCTATGGTCCTTATGGATT 

10 TTTACAAAATGTTATCCCTTCTTTAGACCCTAACTGGTTTATTGGGTATGGTGCAGTCTTA^ 

CAGGAACTGCTAATCATACATTGTTmAACAAATACAATTCGAAGCGTTGACTATCACACTATTGAGG 
AAATATGGGAGCAAAACCATTTACTGTTTTCCGAAAAGTAGTGTTACCAACCnTAATTCCAACTCT 
ACTATTATGG l ' lUlUVr TAGTGGTTTATCTGCAGTAGCAGCACCCATGATTGTTGGTGGTAAAGAATT TCAAA CTAT 
AAATCCAATGATTATTACATTTGCAGGGATGGGGAATTCTCGTGATTTAGCTGCCCTACTTGCAATTAT^ 

15 ATTGCAACTACAATTTTGCTTACTATCATGAA^ 

AAGCGCCTCTTAAAAAACAAAAAATTGCGTCTAAGCCrrGGAATATCATTGCTCACATTGTAGCATA 

CACAGTTTTCATGCTTCCACTAATTTTTATAGTATTATACrrCATTTACAGATCCAGTTGCAAT^ 

TAACATTATCAAACTTTACTTTAGAAAATTATCGCTTATTCIT^ 

AGCTTT ATTT ATTCT ATT ATTG CTGCG AC A AC AGC AAC AATTCTCG C AGTTGT ATTTG CTCGT GTTGTC AG AA AAC A 
20 TAAATCTCGTTTTGATTTCTTATTTGAATATGGTGCTCTACnTCCITGGTT 
GTTTATTATTTACTTTTAATCAGCCACAATTTCTTGTCTTGAATCAGAT^^ 

ATTG C AT AT AT AGTTGT AAAA ATCCC ATTTTCTT AT AG A ATG GTA CGTG CTATTTT ATTT AGTGTTG A TG ATG AG AT 
GG AAG ATG C AG C A AG A AGT ATGG GTG CTTC ACCTTTTT AT A CT ATG ATG A AGGTT ATC ATTCC ATTT ATTTT ACCG 
GTTGTTCTCTCTGTT ATTG CTTTAj\ACTTTAACTCTTT ATT AACTG 
25 CT AGCTC AACC ATT AG GT ATT ACG ATTCG ATCTG C AG GTG ATG AAAC AG C AAC ATCTA ATGC AC AAG CTCTGGT AT 

TTGTTTATACAATTGTTCTGATGATTATTTCTGGAACGGTATTAT 
GAAATAA 
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Table 2 



MEELVTLDCLFIDRTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYFQEEITPLLIKYAMFDFCKQKRGYKESAKNLANW 
HYNDKEDSYTHPDGWYYRFHHTKYQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKAKECQALLSPQGRQIF 
5 AQRKIDVEPVFGQIKASLGYKRCNLRGKRQVR1DMGLVLMANNLLKYSKMKZ 

MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTAEKPKEEKMIVEEKADKALETKNIV 
ERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASNQEKPLKEDAKAVTNEEVNQMIEDRKVDFNQN 
WYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGGEAWYRKTFKLDEKDUOCNVRLTF 

10 DGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLQKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVTDKVHV 
EKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAEYQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVER 
PKLWTVLNDKPALYELITRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAE 
YRRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFD 
LRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDA 

15 VGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFD 
RDNAGYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNWENK 
ELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDES 
GKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRER 
YKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFTGKKEGQEKTVLGTEVPKVQTIIGEAPEMPT 

20 T V PFV YS DGS RAERP VT WSS V D VSK PG I VT VKG M ADG RE V E A R V E V IALKS ELP V VK R1APNTD LNS V DKS VS YV LIDGS 

VEEYEVDKWEIAEEDKAKLAIPGSRIQATGYLEGQPIHATLVVEEGNPAAPAVPTVTVGGEAVTGLTSQKPMQYRTLA 
YGAKLPEVTASAKNAAVTVLQASAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSV 
RAHYQDGTQAVLPADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAKDQVELTIQA^EKKIAQSIRPVNVVT 
DLHQEPSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAP 

25 QLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAF 
ASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDNLSVGFHEDHGVGVPKSYVIEY 
YVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADNKRGTSITEV 
QIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTASVSNNGLATVVPSVREGEPVRVIAKAENGD 
IU3EYRLHFTKDKSLLSHKPVAAVKQARLLQVGQALELPTKVPVYFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVR 

30 GRVLGSNLVAErrVRVTDKLGETLSDNPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNP 
EVSAGVIFRENGKIVERTVTQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYR 
ADKDIAAGDEINVTFKAIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQIT 
YKGQRPKVSVEENNQVASTVVDSGEDSFPVLVRLVSESGKQVKEYR1HLTKEKPVSEKTVAAVQEDLPKIEFVEKDLAY 
KTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQEAKKPQVSEKADTKPID 

35 SSEASQTNKAQLPSTGSAASQAAVAAGLTLLGLSAGLVVTKGKKEDZ 

MKIMKKKYWTLAILFFCLFNNSVTAQEIPKNLDGNITHTQTSESFSESDEKQVDYSNKNQEEVDQNKFRIQIDKTELFVT 
TDKHLEKNCCKLELEPQINNDIVNSESNNLLGEDNLDNKIKENVSHLDNRGGNIEHDKDNLESSIVRKYEWDIDKVTGG 
GESYKLYSKSNSKVSIAILDSGVDLQNTGLLKNLSNHSKNYVPNKGYLGKEEGEEGIISDIQDRLGHGTAVVAQIVGDDN 
40 INGVNPHVNINVYRIFGKSSASPDWIVKAIFDAVDDGNDIINLSTGQYLMIDGEYEDGTNDFETFLKYKKAIDYANQKGV 
IIVAALGNDSLNVSNQSDLLKLISSRKKVRKPGLVVDVPSYFSSTISVGGIDRLGNLSDFSNKGDSDAIYAPAGSTLSLSEL 
GLNNFINAEKYKEDWIFSATLGGYTYLYGNSFAAPKVSGAIAMHDKYKLKDQPYNYMFVKKFWKKHYQZ 

MKKTWKVFLTLVTALVAVVLVACGQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLP 
45 PEESSSDLVINGKAPFAVYFQDYMAKKLEKGAGITAVAAIVEHNTSGIISRKSDNVSSPKDLVGKKYGTWNDPTELAML 
KTLVESQGGDFEKVEKVPNNDSNSITPLANGVFDTAWIYYGWDGILAKSQGVDANFMYLKDYVKEFDYYSPVHANND 
YLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKWGQFDAARWNAFY 
KWDKENGILKEDLTDKGFTNEFVKZ 

50 MKRTWRNSFVTNLl^PFMIGNIEIPNRTVLAPMAGVTNSAFRTIAKELGAGLVVMEMVSDKGIQYNNEKTLHMLHIDE 
GENPVSIQLFGSDEDSLARAAEFIQENTKTDIVDINMGCPVNKIVKNEAGAMWLKDPDKIYSIINKVQSVLDIPLTVKMR 
TGWADPSLAVENALAAEAAGVSALAMHGRTREQMYTGHADLETLYKVAQALTKIPFIANGDIRTVQEAKQRIEEVGA 
DAVMIGRAAMGNPYLFNQINHYFETGEILPDLTFEDKMKIAYEHLKRLINLKGENVAVREFRGLAPHYLRGTSGAAKL 

^ RGAISQASTLAEIETLLQLEKAZ 

MIKNPKLLTKSFLRSFAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEIPSLLQSYSKSLTISAHLK 
RDIVDKRLPLVHDLDIKDGKLSNYIVMLDMSVSTADGKQVTVQFVHGVDVYKEAKNILLLYLPYTFLVTIAFSFVFSYF 
YTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQNQKVSFVRGAS 
HELKTPLASLR1ILENMQHNIGDYKDHPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETTVKPVLVDILSRYQELAH 
60 SIGVTIENQLTDATRVVMSLRALDKVLTNLISNAIKYSDKNGRVIISEQDGYLSIKNTCAPLSDQELEHLFDIFYHSQIVTD 
KDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFK1SLZ 

MYLGDLMEKAECGQFSILSFLLQESQTTVKAVMEETGFSKATLTKYVTLLNDKALDSGLELAIHSEDENLRLSIGAATK 
GRDIRSLFLESAVKYQILVYLLYHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEHQIHYFYFCL 
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FRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLDLVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLR 
LLRKVPSFFAGQHIPLGVEDGEMMIFFSFLLSHRILPLHTMEYILGFGGQLADLLTQLIQEMKKEELLGDYTEDHVTYEL 
SQLCAQVYLYKGYILQDRYKYQLENRHPYLLMEHDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLIEYMAENGGQ 
HMRIGLDLTSGFLVFSRMAAILKRYLEYNRFITIEAYDPSRHYDLLVTNNPIHKKEQTPVYYLKNDLDMEDLVAIRQLLF 
5 TZ 

MEFSKKTRELSUOCMQERTLDLLIIGGGrTGAGVALQAAASGLETGLIEMQDFAEGTSSRSTKLVHGGLRYLKQFDVEV 
VSDTVSERAVVQQIAPHIPKSDPMLLPVYDEDGATFSLFRLKVAMDLYDLLAGVSNTPAANKVLSKIXJVLERQPNLKK 
EGLVGGGVYLDFRNNDARLVIENIKRANQDGALUNHVKAEGFLFDESGKITGVVARDLLTDQVFEIKARLVINTTGPW 
10 SDKVRNI^NKGTQFSQMRPTKGVHLVVDSSKIKVSQPVYFDTGLGDGRMVFVLPRENKTYFGTTDTDYTGDLEHPKVT 
QEDVDYLLGIVNNRFPESNITIDDIESSWAGLRPLIAGNSASDYNGGNNGTISDESFDNLIATVESYLSKEKTREDVESAV 
SKLESSTSEKHLDPSAVSRGSSLDRDDNGLLTLAGGKITDYRKMAEGAMERVVDILKAEFDRSFKLINSKTYPVSGGELN 
' PANVDSEIEAFAQLGVSRGLDSKEAHYLANLYGSNAPKVFALAHSLEQAPGLSLADTLSLHYAMRNELTLSPVDFLLRR 
TNHMLFMRDSLDSIVEPILDEMGRFYDWTEEE1CATYRADVEAALANNDLAELKNZ 

15 

MMNELFGEFLGTLILILLGNGVVAGVVLPKTKSNSSGWIVITMGWGIAVAVAVFVSGKLSPAYLNPAVTIGVALKGGLP 

WASVLPYILAQFAGAMLGQILVWLQFKPHYEAEENAGNILATFSTGPAIKDTVSNLISEILGTFVLVLTIFALGLYDFQA 

GIGTFAVGTLIVGIGLSLGGTTGYALNPARDLGPR1MHSILPIPNKGDGDWSYAWIPVVGPVIGAALAVLVFSLFZ 

20 MTKKKIERISVIHREKrLWLKWYFMRDKEQPKYSVLERKMFDAAKNQDMLAYQKYATIKQ[TDiRVQTSEADILEAVKE 
VYVYNHMNVIGACQRILFISQSPAYDKLNKWFNIYSDLYFSVVPLPKMGVYHEMVGIZ 

MKNSNEAEMKLLYTDIRTSLTEILTREAEELVAAGKRVFYIAPNSLSFEKERAVLEYLSQQASFSITVTRFAQMARYLVL 
NDLPAKTTLDDIGLGLAFYKCLAELDPKDLRVYGAIKQDPQLIQQLIELYHEMTKSQMSFLDLENLTDEDKRADLLLIF 

25 EKVTAYLNQGQLAQESQLSHLIEAIENDKVSSDFNQIALVIDGFTRFSAEEERVVDLLHGKGVEIVIGAYASKKAYTSPFS 
EGNLYQASVKFLHHLASKYQTPAQDCSQTHEKMDSFDKASRLLESSYDFSELALDVDEKDRENLQIWSCLTQKEELEL 
VARSIRQKLHENSDLSYKHFRILLGDVASYQLSLKTIFDQYQIPFYLGRSEAMAHHPLTQFVESILALKRYRFRQEDLINL 
LRTDLYTDLSQSDIDAFEQYIRYLGINGLPAFQQTFTKSHHGKFNLERLNVLRLRILAPLETLFASRKQKAEKLLQKWSV 
FLKEGAVTKQLQDLTTTLEAVEQERQAEVWKAFCHVLEQFATVFAGSQVSLEDFLALLHSGMSLSQYRTIPATVDTVL 

30 VQSYDLIAPLTADFVYAIGLTQDNLPKISQNTSLLTDEERQNLNQATEEGVQLLIASSENLKKNRYTMLSLVNSARKQLF 
LSAPSLFNESESKESAYLQELIHFGFRRREKRMNHKGLSKEDMGSYHSLLSSLVAYHQQGEMSDTEQDLTFVKVLSRVI 
GKKLDQQGLENPAIPTSPSSKTLAKDTLQALYPAKQEFYLSTSGLTEFYRNEYSYFLRYVLGLQEELRLHPDARSHGNFL 
HRIFERALQLPNEDSFDQRLEQAIQETSQEREFEAIYQESLEAQFTKEVLLDVARTTGHILRHNPAIETIKEEANFGGKDQ 
AFIQLDNGRSVFVRGKVDRIDRLKANGAIGVVDYKSSLTQFQFPHFFNGLNSQLPTYLAALKREGEQNFFGAMYLEMA 

35 EPVQSLMAVKSLAGAVVEASKSMKYQGLFLEKESSYLGEFYNKNKANQLTDEEFQLLLDYNAYLYKKAAEKILAGRF 
AINPYTENGRSIAPYVQQHQAITGFEANYHLGQARFLEKLDLADGKRLVGEKLKQAWLEKIREELNRZ 



MKLIPFLSEEEIQKLQEAEANSSKEQKKTAEQIEAIYTSAQNILVSASAGSGKTFVMAERILDQLARGVEISQLFISTFTVK 

40 AATELKERLEKKISKKIQETDDVDLKQHLGRQLADLPNAAIGTMDSFTQKFLGKHGYLLDIAPNFRiLQNQSEQLILENE 
VFHEVFEAHYQGKQKETFSHLLKNFAGRGKDERGLRQQVYKIYDFLQSTSNPQKWLSESFLKGFEKADFTSEKEKLTE 
QIKQALWDLESFFRYHLDNDAKEFAKAAYLENVQLILDEIGSLNQESDSQAYQAVLARVVAISKEKNGRALTNASRKA 
DLKPLADAYNEERKTQFAKLGQLSDQIAILDYQERYHGDTWKLAKTFQSFMSDFVEAYRQRKRQENAFEFADISHYTIE 
ILENFPQVRESYQERFHEVMVDEYQDTNHIQERMLELLSNGHNRFMVGDIKQSIYRFRQADPQIFNEKFQRYAQNPQEG 

45 RLIILKENFRSSSEVLSATNDVFERLMDQEVGEINYDNKHQLVFAhfTKLTPNPDNKAAFLLYDKDDTGEEEESQTETKL 
TGEMRLVIKEILKLHQEKGVAFKEIALLTSSRSRNDQILLALSEYGIPVKTDGEQNNYLQSLEVQVMLDTLRVIHNPLQD 
YALVALMKSPMFGFDEDELARLSLQKAEDKVHENLYEKLVNAQKMASSQKGLIHTALAEKLKQFMDILASWRLYAKT 
HSLYDLIWKIYNDRFYYDYVGALPNGPARQANLYALALRADQFEKSNFKGLSRFIRMIDQVLEAQHDLASVAVAPPKD 
AVELMTIHKSKGLEFPYVFILNMDQDFNKQDSMSEVILSRQNGLGVKYlAKMETGAVEDHYPKTIKLSrPSLTYRQNEEE 

50 LQLASYSEQMRLLYVAMTRAEKKLYLVGKGSREKLESKEYPAAKNGKLNSNTRLQARNFQDWLWAISKVFTKDKLNF 
SYRFIGEDQLTREAIGELETKSPLQDSSQADNRQSDTIKEALEMLKEVEVYNTLHRAAIELPSVQTPSQKKFYEPVMDM 
EGVEIAGQGQSVGKKISFDLPDFSTKEKVTGAEIGSATHELMQR1DLSQQLTLASLTETLKQVQTSQAVRDKINLDKILAF 
FDTVLGQEILANTDHLYREQPFSMLKRDQKSQEDFVVRGILDGYLLYENKIVLFDYKTDRYDEPSQLVDRYRGQLALY 

^ EEALSRAYSIENIEKYLILLGKDEVQVVKVZ 

MELARHAESLGVDAIATIPPIYFRLPEYSVAKYWNDISSAAPNTDYVIYNIPQLAGVALTPSLYTEMLKNPRVIGVKNSS 
MPVQDIQTFVSLGGEDHIVFNGPDEQFLGGRLMGARAGIGGTYGAMPELFLKLNQLIADKDLETARELQYAINAIIGKL 
TSAHGNMYGVIKEVLKINEGLNIGSVRSPLTPVTEEDRPVVEAAAALIRETKERFLZ 

60 

MYKTKCLREKLVLFLKIFFPILIYQFANYSASFVDTAMTGQYNTMDLAGVSMATSIWNPFFTFLTGIVSALVPIIGHHLG 
RGKKEEVASDFYQFIYLALGLSVVLLGMVLFLAPIILNHIGLEAAVAAVAVRYLWFLSIGIIPLLLFSVIRSLLDSLGLTKL 
SMYLMLLLLPLNSGFNYLLIYGAFGVPELGGAGAGLGTSLAYWVLLGISVLVLFKQEKLKALHLEKRIPLNMDKIKEGV 
RLGLPIGGTVFAEVAIFSVVGLIMAKFSPLIIASHQSAMNFSSLMYAFPMSISSAMAIVVSYEVGAKRFDDAKTYIGLGRW 
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TALIFAAFTLTFLYIFRGNVASLYGNDPKFIDLTVRFLTYSLFFQLADTFAAPLQGILRGYKDTVIPFYLGLLGYWGVAIP 
VYAI2 

MSTLAKIEALLFVAGEDGIRVRQLAELLSLPPTGIQQSLGKLAQKYEKDPDSSLALIETSGAYRLVTKPQFAEILKEYSKA 
5 PINQSLSRAALETLSIIAYKQPITRIEIDAIRGVNSSGALAKLQAFDLIKEDGKKEVLGRPNLYVTTDYFLDYMGINHLEEL 
PVIDELEIQAQESQLFGERIEEDENQZ 

MDTMISRFFRHLFEALKSLKRNGWMTVAAVSSVMrTLTLVAIFASVIFNTAKLATDIENNVRVVVYIRKDVEDNSQTIE 
KEGQTVTNNDYHKVYDSLKNMSTVKSVTFSSKEEQYEKLTEIMGDNWKIFEGDANPLYDAYIVEANTPNDVKTIAEDA 
10 KKIEGVSEVQDGGANTERLFKLASFIRVWGLGIAALLIFIAVFLISNTIR1TIISRSREIQIMRLVGAKNSYIRGPFLLEGAFIG 
LLGAIAPSVLVFIVYQIVYQSVNKSLVGQNLSMISPDLFSPLMIALLFVIGVFIGSLGSGISMRRFLKIZ 

MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWVFDTHYQSWFYIKADANYAENEWLKQGDDYF 
YLKSGGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYtKADGQHAEKEW 

15 LQIKGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGY 
MAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYD 
SHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMAKNETVDGYQLGSDGK 
WUjGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKD 
FIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNI 

20 NNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWI 
KENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

MKKVLQKYWAWAFVVIPLLLQAIFFYVPMFQGAFYSFTNWTGLTYNYKFVGLNNFKLLFMDPKFMNAIGFTAIIAIAM 
VVGEIALGIFIARVLNSKIKGQTFFRAWFFFPAVLSGLTVALIFKQVFNYGLPAIGNALHIEFFQTSLLGTKWGAIFAAVF 
25 VLLWQGVAMPIIIFLAGLQSIPTEITEAARIDGATSKQVFWNIELPYLLPSVSMVFILALKGGLTAFDQVFAMTGGGPNN 
ATTSLGLLVYNYAFKNNQFGYANAIAVILFFLIVVISIIQLRVSKKFEIZ 

MMKQDERKALIGKYILLILGSVLILVPLLATLFSSFKPTKDIVDNFFGFPTNFTWDNFSRLLADGIGGYYWNSVVITVLSL 
LAVMinPMAAYSURNMSKRKAFTIMYTLLILGIFVPFQVIMIPITVMMSKLGLANn-FGULLYLTYAIPQTLFLYVGYlKI 
30 SIPESLDEAAEIDGANQFTTYFR1IFPMMKPMHATTMIINALWFWNDFMLPLLVLNRDSKMWTLPLFQYNYAGQYFND 
YGPSFASYVVGIISrnVYLFFQRHIISGMSNGAVKZ 

MKSILQKMGEHPMLLLFI^YSTVISILAQNWMGLVASVGMFLFTIFFLHYQSILSHKFFRLILQFVLFGSVLSAAFASLEH 
FQIVKKFNYAFLSPNMQVWHQNRAEVTFFNPNYYGIICCFCIMIAFYLFTTTKLNWLKVFCVIAGFVNLFGLNFTQNRT 
35 AFPAHAGAIIYLFTTIKNWKAFWLSIGVFAIGLSFLFSSDLGVRMGTLDSSMEERJSIWDAGMALFKQNPFWGEGPLTYM 
NSYPRIHAPYHEHAHSLYIDTIl^YGIVGTILLVl^SVAPVRLMMDMSQESGKRPIIGLYLSFLTVVAVHGIFDLALFWIQS 
GFIFLLVMCSIPLEHRMLVSDMTDZ 

MSKMDVQKIIAPMMKFVNMRGIIALKDGMLAILPLTVVGSLFLIMGQLPFEGLNKSIASVFGANWTEPFMQVYSGTFAI 
40 MGLISCFSIAYSYAKNSGVEALPAGVLSVSAFFILLRSSYIPKQGEAIGDAISKVWFGGQGIIGAIIIGLVVGSIYTFFIKRKIV 
KMPEQVPQAUKQFEAMIPAFVIFI^SMIVYILAKSLTNGGTFIEMIYSAIQVPLQGLTGSLYGAIGIAFFISFLWWFGVH 
GQSVVNGVVTALLLSNLDANKAMLASANLSLENGAHIVTQQFLDSFLILSGSGITFGLVVAMLFAAKSKQYQALGKVA 
AFPAIFNVNEPVVFGFPrVMNPVMFVPFILVPVLAAVIVYGAlATGFMQPFSGVTLPWSTPAlLSGFLVGGWQGVITQLVI 
LAMSTLVYFPFFKVQDRLAYQNEIKQSZ 

45 

MKKKDLVDQLVSEIETGKVRTLG1YGHGASGKSTFAQELYQALDSTTVNLLETDPYITSGRHLVVPKDAPNQKVTASLP 
VAHELESLQRDILACRRVWMSZ 

MKKRYLVLTALLALSLAACSQEKTKNEDGETKTEQTAKADGTVGSKSQGAAQKKAEVVNKGDYYSIQGKYDEUVAN 
50 KHYPLSKDYNPGENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKLYQDYVNQDGKAAADRYSARPGYSEHQT 
GLAFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLEEYYGF 
EGGDYVDZ 

MREPDFLNHFLKKGYFKKHAKAVLALSGGLDSMFLFKVLSTYQKELEIELILAHVNHKQRIESDWEEKELRKLAAEAE 
55 LPIYISNFSGEFSEARARNFRYDFFQEVMKKTGATALVTAHHADDQVETIFMRLIRGTRLRYLSGIKEKQVVGEIEIIRPFL 
HFQKKDFPSIFHFEDTSNQENHYFRNRIRNSYLPELEKENPRFRDAILGIGNEILDYDLAIAELSNNINVEDLQQLFSYSES 
TQRVLLQTYLNRFPDLNLTKAQFAEVQQILKSKSQYRHPIKNGYELIKEYQQFQICKISPQADEKEDELVLHYQNQVAY 
QGYLFSFGLPLEGELIQQIPVSRETSIHIRHRKTGDVLIKNGHRKKLRRLFIDLKIPMEKRNSALIIEQFGEIVSILGIATNNL 
SKKTKNDIMNTVLYIEKIDRZ 

60 

MRKFLIILLLPSFLTISKVVSTEKEVVYTSKEIYYLSQSDFGIYFREKLSSPMVYGEVPVYANEDLVVESGKLTPICTSFQIT 
EWRLNKQGIPVFKLSNHQFIAADKRFLYDQSEVTPTIKKVWLESDFKLYNSPYDLKEVKSSLSAYSQVSIDKTMFVEGRE 
FLHIDQAGWVAKESTSEEDNRMSKVQEMLSEKYQKDSFSIYVKQLTTGKEAGINQDEKMYAASVLKLSYLYYTQEKIN 
EGLYQLDTTVKYVSAVNDFPGSYKPEGSGSLPKKEDNKEYSLKDLITKVSKESDNVAHNLLGYYISNQSDATFKSKMSA 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



PCT/GB99/02451 



75 



IMGDDWDPKEKLISSKMAGKFMEAIYNQNGFVLESLTKTDFDSQRIAKGVSVKVAHKIGDADEFKHDTGVVYADSPFIL 
SIFTKNSDYDTISK1AKDVYEVLKZ 

MKKQNNGL1KNPFLWLLFIFFLVTGFQYFYSGNNSGGSQQINYTELVQEITDGNVKELTYQPNGSVIEVSGVYKNPKTSK 
5 EETGIQFFTPSVTKVEKFTSTILPADTTVSELQKLATDHKAEVTVKHESSSGIWINLLVSIVPFGILFFFLFSMMGNMGGG 
NGI^PMSFGRSKAKAANKEDIKVRFSDVAGAEEEKQELVEVVEFLKDPKRFTKLGARIPAGVLLEGPPGTGKTLLAKA 
VAGEAGVPFFSISGSDFVEMFVGVGASRVRSLFEDAKKAAPAIIFIDEIDAVGRQRGVGLGGGNDEREQTLNQLLIEMDG 
FEGNEGIIVIAATNRSDVLDPALLRPGRFDRKVLVGRPDVKGREAILKVHAKNKPLAEDVDLKLVAQQTPGFVGADLEN 
VLNEAALVAARRNKSUDASDIDEAEDRVIAGPSKKDKTVSQKERELVAYHEAGHTIVGLVLSNARVVHKVTIVPRGRA 
10 GGYMIALPKEDQMLl^KEDMKEQLAGLMGGRVAEEIIFNVQTTGASNDFEQATQMARAMVTEYGMSEKLGPVQYEG 
NHAMLGAQSPQKSiSEQTAYEIDEEVRSLLNEARNKAAEIIQSNRETHKLIAEALLKYETLDSTQIKALYETGKMPEAVE 
EES HALS YDEVKSKMNDEKZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENP 
15 SQDKVEGIRDLFLTILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 
VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPY[DTGQG 
YTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLEIAS 
GAQNLRAKEVGAYELREVTRQFNAMLDQEDQLMVAIRSQEETTRQYQtQALSSQINPHFLYNTLDTIIWMAEFHDSQR 
WQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLHQKQRYGDKLEYEINEhfVAFDNLVLPKLVLQPLVENALYHGI 
20 KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 
EIYINRIETSZ 

MKRSSLLVRMVISIFLVFLILUVLVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENP 
SQDKVEGIRDLFLTILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 

25 VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 
YTPGHKSYVSQEKIAGTDWTVLGVSSLEKLIXJVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLEIAS 
GAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQR 
WQVTKSLATYFRLALNQGKDLICLSDE1NHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 
KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 

30 EIYINRIETSZ 

MFFKLLREALKVKQVRSKILFTIFIVLVFRIGTSITVPGVNANSLNALSGLSFLNMLSLVSGNALKNFSIFALGVSPYITASI 
VVQLLQMDILPKFVEWGKQGEVGRRKLNQATRYIALVLAFVQSIGITAGFNTLAGAQLIKTALTPQVFtTIGIILTAGSMI 
VTWLGEQITDKGYGNGVSMUFAGIVSSIPEMIQGIYVDYFVNVPSSRrrSSIIFVIILirrVLLIIYFTTYVQQAEYKIPIQYTK 
35 VAQGAPSSSYLPLKVNPAGVIPVIFASSITAAPAAILQFLSATGHDWAWVRVAQEMLATTSPTGIAMYALUILFTFFYTF 
VQINPEKAAETYKRVVPISMEFVLVKVQKNICLNFFVVLQLLVPSSLVZ 

MDIRQVTETIAMIEEQNFDIRTn*MGISLLDCIDPDINRAAEKIYQKrTTKAANLVAVGDEIAAELGIPIVNKRVSVTPISLIG 
AATDATDYVVLAKALDKAAKEIGVDFIGGFSALVQKGYQKGDEILINSIPRALAETDKVCSSVKIGSTKSGINMTAVAD 
40 MGRIIKETANLSDMGVAKLVVFANAVEDNPFMAGAFHGVGEADVIINVGVSGPGVVKRALEKVRGQSFDVVAETVKK 
TAFKITRIGQLVGQMASERLGVEFGIVDLSLAPTPAVGDSVARVLEEMGLETVGTHGTTAALALLNDQVKKGGVMAC 
NQVGGLSGAFIPVSEDEGMIAAVQNGSLNLEKLEAMTAICSVGLDMIAIPEDTPAETIAAMIADEAAIGVINMKTTAVRII 
PKGKEG DMIEFGG LLGTA P VMK VNG ASS V DFIS RGGQIP APIHSFKNZ 

45 MTQIIDGKALAAKLQGQLAEKTAKLKEETGLVPGLVVILVGDNPASQVYVRNKERSALAAGFRSEVVRVPETITQEELL 
DLIAKYNQDPAWHGILVQLPLPKHIDEEAVLLAIDPEKDVDGFHPLNMGRLWSGHPVMIPSTPAGIMEMFHEYGIDLEG 
KNAVVIGRSNIVGKPMAQLLLAKNATVTLTHSRTHNLSKVAAKADILVVAIGRAKFVTADFVKPGAVVIDVGMNRDEN 
GFCLCGDVDYEAVAPLASHITPVPGGVGPMTITMLMEQTYQAALRTLDRKZ 

50 MSKFNRIHLVVLDSVGIGAAPDANNFVNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNIPRETPLKTVAAESNPTGY 
ATKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEILTKIEEFSGRKVIREANKPYSGTAVIYDFGPRQMETGELII 
YTSADPVLQIAAHEDIIPLDELYRICEYARSITLERPALLGRIIARPYVGEPGNFTRTANRRDLAVSPFFPTVLDKLNEAGI 
DTYAVGKINDIFNGAGINHDMGHNKSNSHGIDTLLKTMGLAEFEKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDE 
RLPEIIAAMRENDLLLITADHGNDPTYAGTDHTREYIPLLAYSPAFKGNGLIPVGHFADISATVADNFGVETAMIGESFL 

55 DKLVZ 

MFISISAGIVTFLLTLVEIPAFIQFYRKAQITGQQMHEDVKQHQAKAGTPTMGGLVFLITSVLVAFFFALFSSQFSNNVGM 
ILFILVLYGLVGFLDDFLKVFRKINEGLNPKQKLALQLLGGVIFYLFYERGGDILSVFGYPVHLGFFYIFFALFWLVGFSN 
AVNLTDGVDGLASISVVISLSAYGVIAYVQGQMDILLVILAMIGGLLGFFIFNHKPAKVFMGDVGSLALGGMLAAISMA 
60 LHQEWTLLIIGIVYVFETTSVMMQVSYFKLTGGKRIFRMTPVHHHFELGGLSGKGNPWSEWKVDFFFWGVGLLASLLT 
LAILYLMZ 

LFKKNKDILNIALPAMGENFLQMLMGMVDSYLVAHLGLIAISGVSVAGNIITIYQAIFlALGAAISSVISKSrGQKDQSKLA 
YHVTEALKITLLLSFLLGFLSIFAGKEMIGLLGTERDVAESGGLYLSLVGGSIVLLGLMTSLGALIRATHNPRLPLYVSFL 
65 SNALNILFSSLAIFVLDMGIAGVAWGTIVSRLVGLVILWSQLKLPYGKPTFGLDKELLTLALPAAGERLMMRAGDVVIIA 
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LVVSFGTEAVAGNAIGEVLTQFNYMPAFGVATATVMLLARAVGEDDWKRVASLSKQTFWLSLFLMLPLSFSIYVLGVP 
LTHLYTTDSLAVEASVLVTLFSLLGTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIRIGTGYLMGIVLGWGLPGIW 
AGSLLDNGFRWLFLRYRYQRYMSLKGZ 

5 MQTQEKHSQAAVLGLQHLLAMYSGSILVPIMIATALGYSAEQLTYLISTDIFMCGVATFLQLQLNKYFGIGLPVVLGVA 
FQSVAPLIMIGQSHGSGAMFGALIASGIYVVLVSGIFSKVANLFPSIVTGSVrTTIGLTLIPVAIGNMGNNVPEPTGQSLLLA 
An-VLHLLINIFTKGFIKSISILIGLVVGTAIAATMGLVDFSPVAVAPLVHVPTPLYFGMPTFEISSIVMMCIIATVSMVEST 
GVYLALSDrrKDPIDSTRLRNGYRAEGLAVLLGGIFNTFPYTGFSQNVGLVKLSGlKKRLPIYYAAGFLVLLGLLPKFGA 
LAQIIPSSVLGGAMLVMFGFVSIQGMQILARVDFANNEHNFLIAAVSIAAGVGLNNSNLFVSMPTAFQMFFSNGIVVASL 
10 LAIVLNAVLNHKKKZ 

MKDRIKEYLQDKGKVTVNDLAQALGKDSSKDFRELIKTLSLMERKHQIRFEEDGSLTLEDCKKHEITUCGIFHAHKNGFG 
FVSLEGEEDDLFVGKNDVNYAIDGDTVEVVIKKVADRNKGTAAEAKIIDILEHSLTTVVGQIVLDQEKPKYAGYIRSKN 

15 QK1SQPIYVKKPALKLEGTEVLKVFEDKYPSKKHDFFVASVLDVVGHSTDVGIDVLEVLESMDIVSEFPEAVVKEAESVP 
DAPSQKDMEGRLDLRDEITFTIDGADAKDLDDAVHIKALKNGNLEFGVHIADVSYYVTEGSALDKEALNRATSVYVTD 
RVVPMLPER^NGICSLNPQVDRLTQSAIMEIDKHGRVVNYTITQTVIKTSFRMTYSDVNDILAGDEEKRKEYHKIVSSIE 
LMAKLHETLENMRVKRGALNFDTNEAKILVDKQGKPVDIVLRQRGIAERMIESFMLMANETVAEHFSKLDLPFIYRIHE 
EPKAEKVQKFIDYASSFGLRIYGTASEISQEALQDIMRAVEGEPYADVLSMMLLRSMQQARYSEHNHGHYGLAADYYT 

20 HFTSPIRRYPDLLVHRMIRDYGRSKEIAEHFEQVIPEIATQSSNRERRAIEAEREVEAMKKAEYMEEYVGEEYDAVVSSIV 
KFGLFVELPNTVEGLrHITNLPEFYHFNERDLTLRGEKSGITFRVGQQIRIRVERADKMTGEIDFSFVPSEFDVIEKGLKQS 
SRSGRGRDSNRRSDKKEDKRKSGRSNDKRKHSQKDKKKKGKKPFYKEVAKKGAKHGKGRGKGRRTKZ 

MGTTG FTIID LIIL I V YLLA VL V AGI Y FS KK EM KGFCEFFKG DGS V P W YVTS VS IF ATM LS PIS FLG LAGSS Y AGS W IL W F A 
25 QLGMVVAIPLTIRFILPIFARIDIDTAYDYLDKRFNSKALRIISALLFHYQLGRMSIIMYLPSAGLSVLTGIDINILIILMGVV 
AIVYSYTGGLKSVLWTDFIQGVIUSGVVLALFVLIANIKGGFGAVAETLANGKFLAANEKLFDPNLLSNSIFLIVMGSGF 
TII^SYASSQDLVQRFTTTQNIKKLNKMLFTNGVLSLATATVFYLIGTGLYVFYQVQNADSAASNIPQDQIFMYFIAYQL 
PVGITGLILAAIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLAVGLFSIGVSIVMAHSDIKSAYEWF 
NSFMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVSYWAYSLISISVSVVSGYIVSVLTGNKVS 
30 APKYTTIHDITEIKADSSWEVRHZ 

MKFSKKYlAAGSAVrVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 
TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQK 
QEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNM 
35 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHF 
IPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEVVSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGD 
HFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIK 
VRKNIZ 

40 MKKRAIVAVIVLLLIGLDQLVKSYIVQQIPLGEVRSWIPNFVSLTYLQNRGAAFSILQDQQLLFAVrrLVVVIGAIWYLHK 
HMEDSFWMVLGLTLIIAGGLGNFIDRVSQGFVVDMFHLDFINFAIFNVADSYLTVGVIILLIAMLKEEINGNZ 

MNTNLASFIVGLIIDENDRFYFVQKDGQTYALAKEEGQHTVGDTVKGFAYTDMKQKLRLTTLEVTATQDQFGWGRVT 
EVRKDLGVFVDTGLPDKEIVVSLDILPELKELWPKKGDQLYIRLEVDKKDRIWGLLAYQEDFQRLARPAYNNMQNQN 
45 WPAIVYRLKLSGTFVYLPENNMLGFIHPSERYAEPRLGQVLDARVIGFREVDRTLNLSLKPRSFEMLENDAQMILTYLE 
SNGGFMTLNDKSSPDDIKATFGISKGQFKKALGGLMKAGKIKQDQFGTELIZ 

MKDVSLFLLKKVFKSRLNWIVLALFVSVLGVTFYLNSQTANSHSLESRLESRIAANERAINENEEKLSQMSDTSSEEYQF 
AKNNLDVQKNLLTRKTEILTLLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALYPLNIKAH 
50 TLEFPTHGIDQIVWILEVIIPSLFVVAIIFMLTQLFAERYQNHLDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLV 
GSLISGFGQLDYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAFIVIVEVVYLIAYFFKQKMPVLFLSLIGIVGLLFGIQTIQP 
LQRIAHLIPFTYLRSVEILSGRLPKQIDNVDLNWSMGMVLLPCLUFLLLGILFIERWGSSQKKEFFNRFZ 

MMKFILDIVSTPAILVALIAILGLVLQKKKLPDHKGGIKTFVGFLVVSGGAGIVQNSLNPFGTMFEHAFHLSGVVPNNEAI 
55 VAVALTTYGSATAMIMFAGMVFNILIARFTRFKYIFLTGHHTLYMACMIAVILSVAGFTSLPLILLGGLALGIIMSISPAF 
VQKYMVQLTGNDKVALGHFSSLGYWLSGFTGSLIGDKSKSTEDIKFPKSLAFLRDSTVSITLSMAVIYIIVAIFAGSEYIEK 
EISSGTSGLVYALQLAGQFAAGVFVILAGVRLILGEIVPAFKGISERLVPNSKPALDCPIVYTYAPNAVLIGFISSFVGGLVS 
MVIMIASGTVVILPGVVPHFFCGATAGVIGNASGGVRGATIGAFLQGILISFLPVFLMPVLGGLGFQGSTFSDADFGLSGII 
LGMLNQFGSQAGIVIGLVLILAVMFGVSFIKKPSATEEZ 

60 

MIKTFLSALSVILFSIPIITYSFFPSSNLNIWLSTQPILAQIYAFPLATATMAAILSFLFFFLSFYKKNKQIRFYSGILLLLSLIL 
LLFGTDKTLSSASNKTKTLKLVTWNVANQIEAQHIERIFSHFDADMAIFPELATNIRGEQENQRIKLLFHQVGLSMANYD 
IFTSPPTNSGIAPVTVIVKKSYGFYTEAKTFHTTRFGTIVLHSRKQNIPDIIALHTAPPLPGLMEIWKQDLNIIHNQLASKYP 
KAIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPKLFNATIDHILLPKNHYYVKDLDIVSFQNSDHRCIFT 
65 EITFZ 
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MNPIQRSWAYVSRKRLRSFILFLILLVLLAGISACLTLMKSNCT 

ENVSPELETVAKUCDKEAVTGEQSVERDDLSAADNNLVSLTALEDSSFCDVTFTSSAFNLKEGRHLQKGDSKKILIHEEL 
AKKNGLSLHDKIGLDAGQSESGKGQTVEFEIIGIFSGKKQEKFTGLSSDFSENQVFTDVESSQTLLGNSEAQVSAARFYVE 
5 NPKEMDGLMKQVENLALENQGYQVEKENfKAFEQIKDSVATFQTFLTIFLYGMLIAGAGALILVLSLWLRERVYEVGIL 
LALGKGKSSIFUJFCLEVVLVSLGALLPAFVAGNAnTYLLQTLLASGDQASLQDTLAKASSLSTSILSFAESYVFLVLLS 
CLSVALCFLFLFRKSPKEILSSISZ 

MLHNAFAYVTRKFFKSIVIFUILLMASLSLVGLSIKGATAKASQETFKNn-NSFSMQINRRVNQGTPRGAGNIKGEDIKKI 
10 TENKAIESYVKRINAIGDLTGYDLIETPETKKNLTADRAKRFGSSLMITGVNDSSKEDKFVSGSYKLVEGEHLTNDDKDK 
ILLHKDLAAKHGWKVGDKVKLDSNIYDADNEKGAKETVEVTIKGLFDGHNKSAVTYSQELYENTAITDIHTAAKLYGY 
TEDTAIYGDATFFVTADKNLDDVMKELNGISGINWKSYTLVKSSSNYPALEQSISGMYKMANLLFWGSLSFSVLLLALL 
LSLWINARRKEVGILLSIGLKQASILGQFITESIUAIPALVSAYFLANCT^ 
GAEVDGFSKTLSSLDISIQTSDFIIIFVLALVLVVLVMALASSNLLRKQPKELLLDGEZ 

MSQDKQMKAVSPLLQRVINISSIVGGVGSLIFCIWAYQAGILQSKETLSAFIQQAGIWGPPLFIFLQILQTVVPIIPGALTSV 
AGVFIYGHIIGTIYNYIGrVlGCAIIFYLVRLYGAAFVQSVVSKRTYDKYIDWLDKGNRFDRFFIFMMIWPISPADFLCMLA 
ALTKMSFKRYMTHILTKPFTLVVYTYGLTYHDFFWQMLZ 

20 MRNMWVVIKETYLRHVESWSFFFMVISPFLFLGISVGIGHLQGSSMAKNNKVAVVTTVPSVAEGLKNVNGVNFDYKDE 
ASAKEAIKEEKLKGYLTIDQEDSVLKAVYHGETSLENGIKFEVTGTLNELQNQLNRSTASLSQEQEKRLAQTIQFTEKIDE 
AKE^KFIQTlAAGALGFFLYMILrTYAGVTAQEVASEKGTKIMEVVFSSIRASHYFYARMMALFLVILTHIGIYVVGGL 
AAVLLFKDLPFl^QSGILDHLGDAISLhTTLLFILISLFMYVVLAAFLGSMVSRPEDSGKALSPLMILIMGGFFGVTALGAA 
GDNLLLKIGSYIPFISTFFMPFRTINDYAGGAEAWISLAITVIFAVVATGFIGRMYASLVLQTDDLGIWKTFKRALSYKZ 

25 

MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVIVVRSQEKKDALYELVPQEAIRQSAVFLLFV 
GDLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVIIGLVRYKSEEVAELFNLPDYTYSVFG 
MALGVPNQHHDMKPRLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK 
KLLZMLKLIAIVGTNSKRSTNRQLLQYMQKHFTDKAEIELVEIKAIPVFNKPADKQVPAEILEIAAKIEEADGVIIGTPEYD 
30 HSIPAVLMSALAWLSYGIYPLLNKPIMITGASYGTLGSSRAQLQLRQILNAPEIKANVLPDEFLLSHSLQAFNPSGDLVDL 
D VDCKLD AIFDDFRI FVKITEKLRN AQELLRKD A EDFD W EN LZ 

MhTITQLNNGVEIPVLGFGTFKAKDGEEAYRAVLEALKAGYRHID 

QQTYEQTRQALEKSIEKLGLDYLDLYLIHWPNPKPLRENDAWKTRNAEVWRAMEDLYQEGKIRAIGVSNFLPHHLDAL 
35 LETATIVPAVNQVRLAPGVYQDQVVAYCREKGILLEAWGPFGQGELFDSKQVQEIAANHGKSVAQIALAWSLAEGFLP 
LPKSVTTSRIQANLDCFGIELSHEERETLKTIAVQSGAPRVDDVDFZ 

MRCKMLDPIAIQLGPLAIRWYALCIVTGULAVYLTMKEAPRKKIIPDDILDFILVAFPLAILGARLYYVIFRFDYYSQNLG 
EIFAIWNGGLAIYGGLITGALVLYIFADRKLINTWDFLDIAAPSVMIAQSLGRWGNFFNQEAYGATVDNLDYLPGFIRDQ 
40 MYIEGSYRQPTFLYESLWNLLGFALILIFRRKWKSLRRGHITAFYLIWYGFGRMVIEGMRTDSLMFFGFRVSQWLSVVLI 
GLGIMrVIYQNRKKAPYYITEEENZ 

MGKLSSILLGTVSGAALALFLTSDKGKQVCSQAQDFLDDLREDPEYAKEQVCEKLTEVKEQATDFVLKTKEQVESGEIT 
^ VDSILAQTKSYAFQATEASKNQLNNLKEQWQEKAEALDDSEEIVIDITEEZ 

MKTKLIFWGSMLFLLSLSILLTIYLAWIFYPMEIQWLNLTNRVYLKPETIQYNFHILMNYLTNPFSQVLQMPDFRSSAAG 
LHHFAVVKNLFHLVQLVALVTLPSFYVFVNRJVKKDFLSLYRKSLLALVVLPVMIGLGGVLIGFDQFFTLFHQILFVGD 
DTWLFDPAKDPVIMILPETFFLHAFLLFFALYENFFGYLYLKSRRKZ 

50 MTYHFTEEYDIIVIGAGHAGVEASLAASRMGCKVLLATINIEMLAFMPCNPSIGGSAKGIVVREVDALGGEMAKTIDKT 
YIQMKMLNTGKGPAVRALRAQADKELYSKEMRKTVENQENLTLRQTMIDEILVEDGKVVGVRTATHQEYAAKAVIVT 
TGTALRGEIIIGDLKYSSGPNHSLASINLADNLKELGLEIGRFKTGTPPRVKASSINYDVTEIQPGDEVPNHFSYTSRDEDY 
VKDQVPCWLTYTNGTSHEIIQNNLHRAPMFTGVVKGVGPRYCPSIEDKIVRFADKERHQLFLEPEGRNTEEVYVQGLST 
SLPEDVQRDLVHSIKGLENAEMMRTGYAIEYDMVLPHQLRATLETKKISGLFTAGQTNGTSGYEEAAGQG1IAGINAAL 

55 KIQGKPEULKRSDGYIGVMIDDLVTKGTIEPYRLLTSRAEYRLILRHDNADMRLTEMGREIGLVDDERWARFEIKKNQF 
DNEMKRLDSIKLKPVKETNAKVEEMGFKPLTDAVTAKEFLRRPEVSYQDVVAFIGPAAEDLDDKIIELIETEIKYEGYISK 
AMDQVAKMKRMEEKRIPANIDWDDIDSIATEARQKFKLINPETIGQASRISGVNPADISILMVYLEGKNRSISKTLQKSKZ 

MTKQVLLVDDEEHILKLLDYHl^KEGFSTQLVTNGRKALALAETEPFDFILLDIMLPQLDGMEVCKRLRAKGVKTPIM 
60 MVSAKSDEFDKVLALELGADDYLTKPFSPRELLARVKAVLRRTKGEQEGDDSDNIADDSWLFGTLKVYPERHEVYKA 
NKLLSLTPKEFESDKNPFFEVFKVSKVTAQZ 

MTTFKDGFLWGGAVAAHQLEGGWQEGGKGISVADVMTAGRHGVAREITLGVLEGKYYPNHEAIDFYHRYKEDIALF 
AEMGFKCFRTSIAWTR1FPKGDELEPNEEGLQFYDNLFDECLKNGIEPVITLSHFEMPYHLVTEYGGWKNRKUDFFARF 
65 AEVVFKRYKDKVKYWMTFNEINNQANYQEDFAPFTNSGIVYEEGDNREAIMYQAAHYELVASARAVKIGHEINPDFQI 
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GCMUMCPIYPVTCNPKDILMAMKAMQKRYYFADVHVLGKYPEHIFKYWERKGISVDFTAQDKEDLLGGTVDYIGFS 
YYMSFAIDSHRENNPYFDYLETEDLVKNNYVKASEWEWQIDPEGLRYALNWFTDHYHLPLFIVENGFGAIDQVAADG 
MVHDDYRIEYLGAHIREMKKAVVEDGVDLMGYTPWGCIDLVSAGTGEMRKRYGFIYVDKDDNGKGSYNRSPKKSFG 
WYKEVISSNGESVEZ 



MDQQNGLFGFLENHVMGPMGKLAQFKVVRAITAAGMAAVPFTIVGSMFLVFSILPQAFSFWPIVADIFSASFDKFTSLY 
MVANYATMGSULYFVI^LAYELTKIYAEEEELNMNPLNGALLALMAFVNn*VPQIIFDGGMMKTVTSLKEGAVIADG 
WAMGNWARFGTTGIFTAnMAIVTVLr^RMCVKHNWVIKMPEAVPEGVSRGFTALVPGFVVAFV 
10 DIFKVLAIPFGFVSNLTNSWIGLMIIYLLTQLLWIVGIHGANIVFAFVSPIALANMAENAAGGHFAVAGEFSNMFVIAGGS 
GATLGLCLYIAFASKSEQLKAIGRASVVPALFNINEPLIFGLPIIYNPALAIPFILAPMVTATIYYVANSLNFIKPIIAQVPWP 
TPVGIGAFLGTADLRAVLVALVCAFAAFLVYLPFIRVYDQKLVKEEQGIZ 

MKKFYVSPIFPILVGLIAFGVI^FIIFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLEQ 
15 MPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYFVDVSR 
EQAH'DELVTSRPVIGIVSVDNYDDLEDETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRFYLFTDYTVLEGLMNDK 
FSVIDAFREESKQRQLPLTLSMGFSYGDGNHDEIGKVALLNLNLAEVRGGDQVVVKENDETKNPVYFGGGSAASKRT 
RTRTRAMMTAISDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSFIEKEGVTK 
LLSVKDAMGMVTNRSLLILVDHSKTALTLSKEFYDLFTQTIVIDHHRRDQDFPDNAVITYIESGASSASELVTELIQFQNS 
20 KKNRI^RMQASVLMAGMMLDTKNFTSRVTSRTFDVASYLRTRGSDSIAIQEIAATDFEEYREVNELILQGRKLGSDVLI 
AEAKDMKCYDTVVISKAADAMLAMSGIEASFVLAKNTQGFISISARSRSKLNVQRIMEELGGGGHFNLAAAQIKDVTLS 
EAGEKLTEIVLNEMKEKEKEEZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLSIVVLALFIMGARKTKLASFNFSFF 
25 RAKDLARLGl^YLVIVGSNILGSILLQL^NETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKIFRGKENL 
GFVVGTIVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVfMSRTLGISVZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLSIVVLALFIMGARKTKLASFNFSFF 
RAKDLARLGLSYLVIVGSNILGSILLQl^NETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKIFRGKENL 
30 GFVVGTIVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MDTQKIEAAVKMIIEAVGEDANREGLQETPARVARMYQEIFSGLGQTAEEHLSKSFEIIDDNMVVEKDIFFHTMCEHHF 
LPFYGRAHIAYIPDGRVAGLSKLARTVEVYSKKPQIQERLNIEVADALMDYLGAKGAFVVIEAEHMCMSMRGVRKPGT 
ATLTTVARGLFETDKDLRDQAYRLMGLZMKDLFLKRKQAFRKECLGYLRYVLNDHFVLFLLVLLGFLAYQYSQLLQH 
35 FPENHWPILLFVGITSVLLLLWGGTATYMEAPDKLFLLVGEEEIKLHLKRQTGISLVFWLFVQTLFLLLFAPLFLAMGY 
GLPVFLLYVLLLGVGKYFHFCQKASKFFTETGLDWDYVISQESKRKQVLLRFFALFTQVKGISNSVKRRAYLDFILKAV 
QKVPGKIWQNLYLRSYLRNGDLFALSLRLLLLSLLAQVFIEQAWIATAVVVLFNYLLLFQLLALYHAFDYQYLTQLFPL 
DKGQKEKGLQEVVRGLTSFVLLVELVVGLITFQEKLALLALLGAGLVLLVLYLPYQVKRQMQDZ 

40 MRKSIVUWVDNAYLIPLETTIKSVLYHNRDVDTOLNSDIAPEWFKLLGRKMEVVNSTIRSVHIDKELFESYCT 

SYFRITATEVVESDRVLYLDSDIIVTGELATLFEIDLKGYSIGAVDDVYAYEGRKSGFNTGMLLMDVAKWKEHSIVNSL 
LELAAEQNQVVHLGDQSILNIYFEDNWLALDKTYNYMVGIDIYHLAQECERLDDNPPTIVHYASHDKPWNTYSISRLRE 
LWWVYRDLDWSEIAFQRSDLNYFERSNQSKKQVMLVTWSADIKHLEYLVQRLPDWHFHLAAPCDCSEELTSLSQYTN 
VTVYQNVLHSRIDWLLDDSIVYLDIhrTGGEVFNVVTRAQESGKKIFAFDrTRKSMDDGLYDGIFSVERPDDLVDRMKNI 

45 EIEZ 

MTKIYSSIAVKKGLFTSFLLFIYVLGSRIILPFVDLNTKDFLGGSTAYLAFSAALTGGNLRSLSIFSVGLSPWMSAMILWQ 
MFSFSKRLGLTSTSIEIQDRRKMYLTLLIAVIQSLAVSLRLPVQSSYSAILVVLMNTILLIAGTFFLVWLSDLNASMGIGGSI 
VILLSSMVLNIPQDVLETFQTVHIPTGIIVLLALLTLVFSYLLALMYRARYLVPVNKIGLHNRFKRYSYLEIMLNPAGGMP 
50 YMYVMSFLSVPAYLFILLGFIFPNHSGLAALSKEFMVGKPLWVYVYISVLFLFSIIFAFVTMNGEEIADRMKKSGEYIYGI 
YPGADTSRFINRLVLRFSVIGGLFWIMAGGPMLFVLFDEKLLRLAMIPGLFMMFGGMIFTIRDEVKALRLNETYRPLIZ 

MSSI^DQELVAKTVEFRQRI^EGESLDDILVEAFAVVREADKRILGMFPYDVQVMGAIVMHYGNVAEMNTGEGKTLT 
ATMPVYLNAFSGEGVMVVTPNEYLSKRDAEEMGQVYRFLGLTIGVPFTEDPKKEMKAEEKKLIYASDIIYTTNSNLGFD 

55 YLNDNLASNEEGKFLRPFNYVIIDEIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTLVTTLVEGEDYIFKEEKEEVWLTTKG 
AKSAENFLGIDNLYKEEHASFARHLVYA1RAHKLFTKDKDYURGNEMVLVDKGTGRLMEMTKLQGGLHQAIEAKEHV 
KLSPETRAMASrrYQSLFKMFNKISGMTGTGKVAEKEnETYNMSVVRIPTNRPRQRIDYPDNLYITLPEKVYASLEYlKQ 
YHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGKGVAEL 
GGLrVIGTERMESQRIDLQIRGRSGRQGDPGMSKFFVSLEDDVIKKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRKL 

60 VEKAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHFIVTN 
ISFHVKEVPDYIDVTDKTAVRSFMKQVIDKELSEKKELLNQHDLYEQFLRLSLLKAIDDNWVEQVDYLQQLSMAIGGQS 
ASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ 

MIGTFAAALVAVLANFIVPIEITPNSANTEIAPPDGIGQVLSNLLLKLVDNPVNALLTANYIRJLSWAVIFGIAMREASKNS 
65 QELLKTLADVTSKIVEWIINLAPFGILGLVFKTISDKGVGSLANYGILLVLLVTTMLFVAPVVNPLIAFFFMRRNPYPLVW 
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TGYGTDYACKEWADAYFPKLLEGGQLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTY 
GKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQLLFRMDSGFATPKLYDLIE 
KTGQYYLKUCKr^TTVLSRLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNLFYDVISLVTNMTS 
GTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTLIKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTKRFRHLFL 
5 HVVGKCVRTGRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMH2 

MMEFFQQLPHLEPYGNPQYFVYVIAATLP1FIGLFFKKRFAWYEVLVSLFFIVTMLVGGKTNQLAALGIYLCWEILLLLF 
YKHYRKSKDGKWVFYLVSFLSLLPIIFVKVQPAINGTQSLLGFLGISYLTFRSVGIVIELRDGVIKDFTLWEFLRFLLFMPT 
FSSGPIDRFKRFNENYQAIPERDELMDMLDESVRYIMWGFLYKFILAHVLGETLLPPLKNLALQSGGFFNLYALAVMYT 
10 FGLELFFDFAGYSMFALAISNLMGIRSPlNFNKPFLSRDLKEFWNRWHMSLSFWFRDFVFMRMVMVLTRiCKVFKNRN 
VTSSMAYIVNMLIMGFWHGVTWYYIAYGLFHGLGLVINDAWVRKKKTLNKERKKAGKAALPENRWIQLLGMVVTFH 
VVMLSFLIFSGFLNNLWFKKZ 

MLKRLWMIFGPVLUGLLVFLLIFFYPTEMHHNLGAEKRSAVATTIDSFKERSQKVRALSDPNVRFVPFFGSSEWLRFD 
15 GAHPAVLAEKYNRSYRPYLLGQGGAASLNQYFGMQQMLPQLENKQVVYVISPQWFSKNGYDPAAFQQYFNGDQLTS 
FLKHQSGDQASQYAATRLLQQFPNVAMKDLVQKLASKEELSTADNEMIELLARFNERQASFFGQFSVRGYVNYDKHV 
AKYLKILPDQFSYQAIEDVVKADAEKm^NNEMGMENYFYNEQIKKDLKKLKDSQKSFTYLKSPEYNDLQLVLTQFSK 
SKVNPIFIIPPVNKKWMNYAGLREDMYQQTVQKIRYQLESQGFTNIADFSKDGGEPFFMKDTIHLGWLGWLAFDKAVD 
PFLSNPTPAPTYHLNERFFSKDWATYDGDVKEFQZ 

20 

MEKNLKALKQTTDQEGPAIEPEKAEDTKTVQNGYFEDAAVKDRTLSDYAGNWQSVYPFLEDGTFDQVFDYKAKLTG 
KMTQAEYKAYYTKGYHTDVTKINITDNTMEFVQGGQSKKYTYKYVGKKILTYKKGNRGVRFLFEATDADAGQFKYV 
QFSDHNVAPVKAEHFHIFFGGTSQEALFEEMDNWPTYYPDNLSGQEIAQEMLAHZ 

25 MKDGHLl^HHIRLLNGRIFQKLl^QDPEALYRGEQGKILAVLWNSETGCATATDIALATGLAN^LTTMIKKLEEQKL 
VIVSPCGKDKRKKYLVLTELGKSQKEVGHRVSQKLDTIFYKGFSEEEIHQFEGFQERILANLKEKGNEVZ 

MTNLIATFQDRFSDWLTALSQHLQLSLLTLLLAILLAIPLAVFLRYHEKLADWVLQIAGIFQTIPSLALLGLFIPLMGIGTL 
PALTALVIYAIFPILQNTrTGLKGIDPNLQEAGlAFGMTRWERLKKFEIPLAMPVIMSGIRTAAVLHGTATLAALIGAGGL 

30 GSFILLGIDRNNASLILIGALSSAVLAIAFNFLLKVMEKAKLRTIFSGFALVALLLGLSYSPALLVQKEKENLVIAGKIGPEP 
EILANMYKLLIEE^MTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTESLLQPSPKVSHEPEQVYQVARDGIAKQD 
HLAYLKPMSYQNTYAVAVPKKIAQEYGLKTISDLKKVEGQLKAGFaEFNDREDGr^KGLQSMYGLNLNVATIEPALRY 
QAIQSGDIQrTDAYSTDAELERYDLQVLEDDKQLFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQLNYQVG 

^ VEGKSAKQVAKEFLQEQGLLKKZ 

MMHTYLQKKIENIKTTLGEMSGGYRRMVAAMADLGFSGTMKAIWDDLFAHRSFAQWIYLLVLGSFPLWLELVYEHRi 
VDWIGMICSLTGIICVIFVSEGRASNYLFGLINSVIYLILALQKGFYGEVLTTLYFTVMQPIGLLVWIYQAQFKKEKQEFV 
ARKLDGKGWTKYLSISVLWWLAFGFIYQSIGANRPYRDSITDATNGVGQILMTAVYREQWIFWAATNVFSIYLWWGES 
LQIQGKYLIYLINSLVGWYQWSKAAKQNTDLLNZ 

40 

MRNMKAKYAVWVAFFLNLTYAIVEFIAGGVFGSSAVLADSVHDLGDAIAIGISAFLETISNREEDNQYTLGYKRFSLLG 
ALVTAVILVTGSVLVILENVTKILHPQPVNDEGILWLGIIAITINLLASLVVGKGKTKNESILSLHFLEDTLGWVAVILMAI 
VLRFTDWYILDPLLSLVISFFILSKALPRFWSTLKIFLDAVPEGLDIKQVKSGLERLDNVASLNQLNLWTMDALEKNAIV 
HVCLKEMEHMETCKESIRIFLKDCGFQNITIEIDADLETHQTHKRKVCDLERSYEHQHZ 

45 

MIEYKNVALRYTEKDVLRDVNLQIEDGEFMVLVGPSGSGKTTMLKMINRLLEPTDGNIYMDGKR1KDYDERELRLSTG 
YVLQAlAtFPNLTVAENlALIPEMKGWSKEEITKICTEELLAKVGLPVAEYGHRLPSELSGGEQQRVGIVRAMIGQPKIFL 
MDEPFSALDAISRKQLQVLTKELHKEFGMTTIFVTHDTDEALKLADRIAVLQDGEIRQVANPETILKAPATDFVADLFG 
GSVHDZ 

50 

MSAVAISAMTKVMQETHGNPSSIHGHGRQAGKLLREARQELAQLLRTKPQHIFFTSGGTEGNNTTIIGYCLRHQEQGKH 
IITrAIEHHAVLETIDYLVQHFGFEATIIQPENQEITAQQIQKALRDDTILVSTMFVNNETGNLLPIAEIGQILKQHPAAYH 
VDAVQAIGKIPIHSEELGIDFLTASAHKFHGPKGIGFLYASSMDFDSYLHGGDQEQKKRAGTENLPAIVGMVAALKEDL 
EKQEEHFQHVQNLETAFLAELEGIQYYLNRGKHHLPYVLNIGFPGQKNDLLLLRLDLAGISISTGSACTAGVVQSSHVLE 
55 AM YG ANS ERLKES LRISLS PQ NT V E DLQTLAKTLKEIIGG Z 



MLFKLSKEKIELGLSRLSPARR1FLSFALVILLGSLLLSLPFVQVESSRATYFDHLFTAVSAVCVTGLSTLPVAHTYNIWG 
QIICLLLIQIGGLGLMTFIGVFYIQSKQKLSLRSRATIQDSFSYGETRSLRJCFVYSIFLTTFLVESLGAILLSFRLIPQLGWGR 
60 GLFSSIFLAISAFCNAGFDNLGSTSLFAFQTDLLVNLVIAGLIITGGLGFMVWFDLAGHVGRKKKGRLHFHTKLVLLLTI 
GLLLFGTATTLFLEWNNAGTIGNLPVADKVLVSFFQTVTMRTAGFSTIDYTQAHPVTLLIYILQMFLGGAPGGTAGGLK 
ITTFFVLLVFARSELLGLPHANVARRTIAPRTVQKSFSVFIIFLMSFLIGLILLGITAKGNPPFIHLVFETISALSTVGVTANL 
TPDLGKLALSVIMPLMFMGRIGPLTLFVSLADYHPEKKDMIHYMKADISIGZ 

65 
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ILILVGSLLVSPWSVYDIAEFFSRGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQEETEKALLDLPPVDMETGE 

ILTEEAVQNLPPIPEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKIVRE 

NIKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGKSLIGIEVPNSDIATV 

SFRELWEQSQTKAENFLEIPLGKAVNGTARAFDLSKMPHLLVAGSTGSGKSVAVNGIIASILMKARPDQVKFMMVDPK 

MVELSVYNDIPHLLlPVVTNPRKASKALQKVVDEMENRYELFAKVGVRNIAGFNAKVEEFNSQSEYKQiPLPFIVVIVDE 

LADLMMVASKEVEDAIIRLGQKARAAGIHMILATQRPSVDVISGLIKANVPSRVAFAVSSGTDSRTILDENGAEKLLGRG 

DMLFKPIDENHPVRLQGSFISDDDVERIVNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKA 

SASMIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQZ 



MSYFKKYKFDKSQFKLGMRTFKTGIAVFLVLLIFGFFGWKGLQIGALTAVFSLRESFDESVHFGTSRILGNSIGGLYALV 
FFLLNTFFHEAFWVTLVVVPICTMLTIMTNVAMNNKAGVIGGVAAMLIITLSIPSGETILYVFVRVLETFMGVFVAIIVN 
YDIDRIRLFLEKKEKZ 



MNKSEHRHQLIRALITKNKIHTQAELQALLAENDIQVTQATLSRDIKNMNLSKVREEDSAYYVLNNGSISKWEKRLELY 
MEDALVWMRPVQHQVLLKTLPGLAQSFGSIIDTLSFPDAIATLCGNDVCLIICEDADTAQKCFEELKKFAPPFFFEEZ 



MKSIKLNALSYMGIRVLNIIFPILTGTYVARVLDRTDYGYFNSVDTILSFFLPFATYGVYNYGLRAISNVKDNKKDLNRT 

FSSLFYLCIACTILTTAVYILAYPLFFTDNPIVKKVYLVMGIQLIAQIFSIEWVNEALENYSFLFYKTAFIRILMLVSIFLFVK 

NEHDIVVYTLVMSLSTLIOTLISYFWIKRDIKLVKIHLSDFKPLFLPLTAMLVFANANMLFTFLDRLFLVKTGIDVNVSY 

YTIAQRIVTVIAGWTGAIGVSVPRLSYYLGKGDKEAYVSLVNRGSRIFNFFIIPLSFGLMVLGPNAILLYGSEKYIGGGIL 

TSLFAFRTnLALDTILGSQILFTNGYEKRITVYTVFAGLLNLGLNSLLFFNHIVAPEYYLLTTMLSETSLLVFYIIFIHRKQL 

IHLGHIFSYTVRYSLFSLSFVAIYFLINFVYPVDMVINLPFLINTGUVLLSAISYISLLVFTKDSIFYEFLNHVLALKNKFKK 

SZ 



MELFMKITNYEIYKLKKSGLTNQQILKVLEYGENVDQELLLGDIADISGCRNPAVFMERYFQIDDAHLSKEFQKFPSFSIL 
DDCYPWDLSEIYDAPVLLFYKGNLDLLKFPKVAVVGSRACSKQGAKSVEKVIQGLENELVIVSGLAKGIDTAAHMAAL 
QNGGKTlAVIGTGLDVFYPKANKRLQDYIGNDHLVLSEYGPGEQPLKFHFPARNRIIAGLCRGVrVAEAKMRSGSLITCE 
RAMEEGRDVFAIPGSILDGLSDGCHHLIQEGAKLVTSGQDVLAEFEFZ 



MKQLTVEDAKQIELEILDYIDTLCKKHNINYIINYGTLIGAVRHEGFIPWDDDIDLSMPREDYQRFINIFQKEKSKYKLLS 
LETDKNYFNNFIKITDSTTKIIDTRNTKTYESGIFIDIFPIDRFDDPKVIDTCYKLESFKLLSFSKHKNIVYKDSLLKDWIRT 
AFWLLLRPVSPRYFANKIEKEIQKYSRENGQYMAFIPSKFKEKEVFPSGTFDKTIDLPFENLSLPAPEKFDTILTQFYGDY 
MTLPPEEKRFYSHEFHAYKLEDZ 



MIKINHLTrTQNKDLRDLVSDLTMTIQDGEKVAIIGEEGNGKSTLLKILMGEALSDFTIKGNIQSDYQSLAYIPQKVPEDL 
KKKTLHDYFFLDSIDLDYSILYRLAEELHFDSNRFASDQE[GNLSGGEALKIQLIHELAKPFEILFLDEPSNDLDLETVDW 
LKGQIQKTRQTVIFISHDEDFLSETADTIVHLRLVKHRKEAETLVEHLDYDSYSEQRKANFAKQSQQAANNQRAYDKT 
MEKHRRVKQNVETALRATKDSTAGRLLAKKMKTVl^QEKRYEKAAQSMTQKPLEEEQIQLFFSDIQPLPASKVLVQLE 
KENLSIDDRVLVQKLQLTVRGQEKIGIIGPNGVGKSTLLAKLQRLLNDKREISLGFMPQDYHKKLQLDLSPIAYLSKTGE 
KEELQKIQSHLASLNFSYPEMQHQIRSLSGGQQGKLLLLDLVLRKPNFLLLDEPTRNFSPTSQPQIRKLFATYPGGLITVS 
HDRRFLKEVCSIIYRMTEHGLKLVNLEDLZ 



MKPKTFYNLLAEQNLPLSDQQKEQFERYFELLVEWNEKINLTAITDKEEVYLKHFYDSIAPILQGLIPNETIKLLDIGAGA 
GFPSLPMKILYPELDVTIIDSLNKR1NFLQLLAQELDLNGVHFYHGRAEDFAQDKNFRAQYDFVTARAVARMQVLSELT 
IPYLKVGGKLLALKASNAPEELLEAKNALNLLFSKVEDNLSYALPNRDPRYITVVEKKKETPNKYPRKAGMPNKRPLZ 



MSIKUAVDIDGTLVNSQKEITPEVFSAIQDAKEAGVKVV1ATGRPIAGVAKLLDDLQLR0EGDYVVTFNGALVQETATG 
HEIISESLTYEDYLDMEFLSRKLGVHMHAITKDGIYTANRNIGKYTVHESTLVSMPIFYRTPEEMAGKEIVKCMFIDEPEI 
LDAA1EKIPAEFYERYSINKSAPFYLELLKKNVDKGSAITHLAEKLGLTKDETMAIGDEENDRAMLEVVGNPVVMENGN 
PEIKKIAKYITKTNDESGVAHAIRTWVLZ 



MTWIILGVIALIVIFVIVSYNGLVKNRMQTKEAWSQIDVQLICRRNDLLPNLIETVKGYAKYEGSTLEKVAELRNQVAAA 
TSPAEAMKASDALTRQVSGIFAVAESYPDLKASANFVKLQEELTNTENKISYSRQLYNSVVSNYNVKLETFPSNUAGMF 
GFKAADFLQTPEEEKSVPKVDFSGLGDZ 
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NCLRVSGVTAFlHTtfSATNIPVNMKLCHDLGLNPDTO 

VAAISSCDASGIAGGSLLUPVACSLFGISND1AIQIVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATRKKZ 



5 MSISQRTTKULATCIJVCLLAYFLNIJSSAVSAGIIAL^ 

LAFYVPUVYKMGWEIGrTPSTVLVSHLLVQESTSPDIXVNEFLLFAIGTGFALLVNLYMPSREEEIQHYHTLVEEKLKDI 
LQRFKYYl^RGIXjRNRAQLVAELDTLlXEALRLVYLDHSDHLFHQTDYHIH^ 

ESLILAQLFSKIAGQLSQTNPASDLLDEIERYLEVFRNRSLPKTREEFETRATLLQLLREAKTFIQVKVDFYQKYRQZ 

10 MEIMSl^IAVFAVIIGLVIGYVSISAKMKSSQEAAELMLLNAEQEATNLRGQAEREADLLVNEAKRESKSLKKEALLEAK 
EEARKYREEVDAEFKSERQELKQIESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAKNLDAREEQLEEVERQKEAE 
LERIGALSQAEARDIIlj\QTEENLTREIASRIREAEQEVKERSDKMAKDILVQAMQRUGEYVAESTNSTVHLPDDTMKG 
WIGREGRNmTFESLTGVDVIIDDTPEVVTl^GFDPIRREIARMTMEMLLKDGRIHPARIEELVEKNRQEIDNKIREYGEA 
AAYEIGAPNLHPDU4KIMGRLQFRTSYGQNVLRHSIEVAKLAGIMASELGENAALARRAGFLHDIGKAIDHEVEGSHVE 

15 IGMELARKYKEPPVVV^IASHHGDVEAESVIAVIVAAADAI^AARPGARSESLESYKRLHDLEEIANGFEGVQTSFAL 
QAGREIRIMVNPGKKDDKVTILAHKVRKKIENNLDYPGNIKVTVIRELRAVDYAKZ 

MMLKPSIDTLLDKVPSKYSLVILEAKRAHELEAGAPATQGFKSEKSTLRALEEIESGNVTIHPDPEGKREAVRRRIEEEKR 
^ RKEEEEKKKEQIAKEKEDGEKIZ 

MSAYQLPTVWQDEASNQGAFTGLNRPTAGARFEQNLPKGEQAFQLYSLGTPNGVKVTILLEELLEAGFKEAAYDLYKI 
AIMDGDQFGSDFVKLNPNSKIPALLDQSGTENVRVFESAHILLYLAEKFGAFLPSNPVEKVEVLNWLFWQAGAAPFLG 
GGFGHFFNYAPEKLEYPINRFTMEVKRQLDLLDKELAQKPYIAGNDYTIADIAIWSWYGQLVQGNLYQGSAKFLDASS 
^ YQNLVKWAEKIANRPAVKRGLEVTYTEIKZ 

LASLrTSIIMFYVGFDVLRDTIQKIl^REETVIDPLGA^ 

GTAIAILASSFNYPIVDKLVAIHTFFILKTAYDIFIESSFSLSDGFDDRLLEDYQKAIMEIPKISKVKSQRGRTYGSNIYLDIT 
LEMNPDLSVFESHEIADQVESMLEERFGVFDTDVHIEPAPIPEDEILDNVYKKLLMREQLIDQGNQLEELLTDDFVYIRQ 
DGEQMDKEAYKTKKELNSAIKDIQrTSISQKTKLICYELDGIIHTSIWRRHETWQNIFHQETKKEZ 

30 

MTIKLVATDMDGTFLDGNGRFDMDRLKSLLVSYKEKGIYFAVASGRGFLSLEKLFAGVRDDUFIAENGSLVEYQGQDL 
YEATMSRDFYLATFEKLICTSPYVDINKLLLTGKKGSYVLDTVDETYLKVSQHYNENIQKVASLEDrrDDIFKFTTNFTEE 
TLEDGEAWVNENVPGVKAMTTGFESIDIVLDYVDKGVAIVELVKKLGITMDQVMAFGDNLNDLHMMQVVGHPVAPE 
NARPEILELAKTVIGHHKERSVIAYMEGLZ 

35 

MADIKLIALDLDGTLLTTDKRLTDRTKETLQAARDRGIKVVLTTGRPLKAMDFFLHELGTDGQEDEYTITFNGGLVQK 
NTGEILDKTVFSYDDVARLYEETEKLSLPLDAISEGTVYQIQSDQESLYAKFNPALTFVPVDFEDLSSQMTYNKCVTAFA 
QEPLDAAIQKISPELFDQYEIFKSREMLLEWSPKNVHKATGLAKLISHLGIDQSQVMACGDEANDLSMIEWAGLGVAM 
QNAVPEVKAAANVVTPMTNDEEAVAWAIEEYVLKENZ 

MESLLILLLIANLAGLFLIWQRQDRQEKHLSKSLEDQADHLSDQLDYRFDQARQASQLDQKDLEVVVSDRLQEVRIELH 
QGLTQVRQEMTDNLLQTRDKTDQRLQALQESNEQRLEQMRQTVEEKLEKTLQTRLQASFETVSKQLESVNRGLGEMQ 
TVARDVGALNKVl^GTKTRGILGELQLGQIIEDIMTPAQYEREYATVENSSERVEYAIKLPGQGDQEYVYLPIDSKFPLA 
DYYRLEEAYETGDKDEIERCRKSLLASVKRFARDIRNKYIAPPRTTNFGVLFVPTEGLYSEIVRNPVFFDDLRREEQIIVA 
45 GPSTLSALLNSLSVGFKTLNIQKSADHISKTLASVKTEFGKFGGILVKAQKHLQHASGNIDELLNRRTIAIERTLRHIELSE 
GEPALDLLHFQENEEEYEDZ 

MKISHMKKDELFEGFYLIKSADLRQTRAGKNYLAFTFQDDSGEIDGKLWDAQPHNIEAFTAGKVVHMKGRREVYNNT 
PQVNQITLRLPQAGEPNDPADFKVKSPVDVKEIRDYMSQMIFKIENPVWQRIVRNLYTKYDKEFYSYPAAKTNHHAFET 
50 GLAYHTATMVRl^DAlSEVYPQLhOCSLLYAGIMLHDLAKVIELTGPDQTEYTVRGNLLGHIALIDSEITKTVMELGIDDT 
KEEVVLLRHVll^HHGLLEYGSPVRPRiMEAEIIHMIDNLDASMMMMSTALALVDKGEMTNKIFAMDNRSFYKPDLDZ 

MSEKAKKGFKMPSSYTVLLIlIAIMAVLTWnPAGAFIEGIYETQPQNP(^IWDVLMAPIRAMLGTHPEEGSLIKETSAAID 
VAFFILMVGGFLGIVNKTGALDVG1ASIVKKYKGREKMLILVLMPLFALGGTTYGMGEETMAFYPLLVPVMMAVGFDS 
55 LTGVAIILLGSQIGCLASTLNPFATGIASATAGVGTGDGIVLRLIFWVTLTALSTWFVYRYADKIQKDPTKSLVYSTRKED 
LKHFNVEESSSVESTl^SKQKSVLFLFVLTnLMVLSFIPWTDLGVTIFDDFNTWLTGLPVIGNIVGSSTSALGTWYFPEG 
AMLFAFMGILIGVIYGLKEDKnSSFMNGAADLLSVALIVAIARGIQVIMNDGMITDTILNWGKEGLSGLSSQVFIVVTYIF 
YLPMSFLIPSSSGLASATMGIMAPLGEFVNVRPSLIITAYQSASGVLNLIAPTSGIVMGALALGRINIGTWWKFMGKLVVA 
IIVVTIALLLLGTFLPFLZ 

60 

MSNSFVKLLVSQLFANLADIFFRVTIIANIYIISKSVIATSLVPILIGISSFVASLLVPLVTKRLALNRVLSLSQFGKTILLAIL 
VGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAVQLIGWGLGGLLFATIGLLPTT 
CINLVLYIISSFLMLFLPNAEVEVLESETNLEILLKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESY 
WGYSNTAYSIGIIISGLIAFRLSEKFLAAKWEPQLFTPNLKTIQNPCLSLDPGWFLFSPNGCFLLDKKEFPLYGISVEKNTK 
65 RKETHMNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKERISKYLVTNDQRRYCRYSDSDILVQFLFQLL 
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MLFDQIASNKRKTWILLLVFFLLLALVGYAVGYLFIRSGLGGLVIALIIGFIYALSMIFQSTEIVMSMNGAREVDEQTAPD 
LYHVVEDMALVAQIPMPRVFIIDDPALNAFATGSNPQNAAVAATSGLLAIMNREELEAVMGHEVSHIRNYDIRISTIAV 
ALASAITMI^SMAGRMMWWGGAGRRRSDDDRDGNGLEIIMLVVSLLAIVLAPLAATLVQLAISRQREFLADASSVELT 
RNPQGMINALDKLDNSKPMSRHVDDASSALYINDPKKGGGFQKLFYTHPPISERIERLKQMZ 



MKLNIQEIRKQSEGLNFEQTLDLVDDLRARNQEILDVKDILAVGKVQYEDRMYFLDYQLSYTIVLASSRSMEPVELVES 
YPVTEVFMEGATNQLDQEVLDDDLVLPIENGELDLAESVSDNILLNIPIKVLTAEEEAGQGFISGNDWQIMTEEEYQAQ 
KAVKKEENSPFAGLQGLFDGDEZ 

MKRQLALVVFSGGQDSTTCLFWVMQHYETVEAVTFAYGQRHHLEIQITREIAKEQGIRHHILDMSLLCQITAQPDFATI 
HISYIPDKLCVESKSLKLYLFSYRNHGDFHENCINTIGKDLVNLLDPRYLEVWGKFTPRGGISIDPYYNYGKQGTKYEGL 
AEQRLFQHDLYPEKIDNRZ 



MTETVEDKVSHSITGLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKTVEKTDALATNDTVVLGTISTSNSASSTSLSA 
SESASTSASESAST^ASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDYVASVTNVNLQSYA 
KRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYT 

20 VTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNG 
AQMNGFFAKKGYGLTSSWTVprTGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASASAS 
TSASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASE 
SASTSASASASTSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTST 
SASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTS 

25 ASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASA 
STSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASA 
SASTSASASARQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQHQPVPRLQRQPVRQLQQVPVLQSQHQQVLQ 
PQHRQVPRLQQAHQHLNQRRQAPQLQQVPVRQPQRRQVRQPQQVLVHQLQHQRVHRLRRQPVHQSQQVPVRQLPHQ 
QVPRLQQAPVRRLQQVLAPQPQPQPVRQPQQVSQRLNRHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLNRHQRVRPL 

30 QQVLAPQPQRQQVHRLQHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLSQHQRVRQPQQAHQLLNLHQPVRQPQHRQ 
APQLQQVPVRQPQRRQVRRLQQVPVRQPQQVPVRQPQRRQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQH 
QPVHQSQQVPVRQFRINKCLGFSKYZ 



35 MGVETWFYSSICWLAIGLGSVWKFPYMTAANGGGGFLLIFLISTILIGFPLLLAEFALGRSAGVSAIKTFGKLGKNNKYN 
FIGWIGAFALHLLSFYSVIGGWILVYLGIEFGKLFQLGGTGDYAQLFTSIISNPAIALGAQAAFILLNIFIVSRGVQKGIERA 
SKVMMPLLFIVFVFIIGRSLSLPNAMEGVLYFLKPDFSKLTSTGLLYALGQSFFALSLGVTVMLTYASYLDKKTNLVQSG 
ISlVAMNISISIMAGLAIFQARSPFNIQSEGGPSLLFIVLPQLFDKMPFGTIFYVLFLLLFLFATVTFSVVMLEINVDNrTNQD 
NSKRAKWSVILGILTFVFGIPSALSYGVMADVHIFGKTFFDAMDFLVSNLLMPFGALYLSLFTGYIFKKALAMEELHLD 

40 ERAWKQGLFQVWLFLLRFFVSSFQSSSLWSSLPNLCNQKGLEZ 



MLKKWQLKDVILL^FI^IFFGGVFVGSGYVYNILSLLLTPLGLQAFANEILFGLWCMAAPIAAIFVPRVGSATIGEVLAA 
LAEVLYGSQFGLGALLSGFVQGLGSEFGFIVTKNRYESWLSLTANSIGITLVSFVYEYIKLGYYAFSLPFVLSLLVVRF1SV 
45 YFFCTILVRAIVKLYHQFATGGKAZ 



MVKVATQTPIISLFU.ILSLETSFIPSIALTLSVVAFCILFMLYYRRFKMLAWMHLAILPSFANYWAVQLHGDASQAVML 
GTRAFVTVCIGLVFVSSVSLKELLLYLAQKGLSRSWSYALIVVFNSFPLIQQEIKSLKEACLLRGQELHFWSPLIYSKVLM 
50 TVFRWRHLYLRALSAHGYDEHAQLKNSYRTFYIPKKTKLIYLLFFLLLQTSLFLZ 



MRKHQLQVHKLTIl^MMIALDVVLTPIFRIEGMAPMSSVVNiLAGIMMGPVYALAMATVTAFIRMTTQGIPPLALTGAT 
FGALLAGLFYKYGRKFHYSALGEILGTGIIGSIVSYPVMVLFTGSAAKLSWFIYTPRFFGATLIGTAISFIAFRFLIKQEFFK 
55 KVQGYFFSERIDZ 



MQEFTNPFPIGSSSLIHCITNEISCEMLANGILALGCKPVMADDSREVLDFTKQSQALFINLGHLSAEKEKAIRMAASYAN 
QSSLPMVVDAVGVTTSSIRKSLVKDLLDYRPTVLKGNMSEIRSLVGLKHHGVGVDASAKDQETEDLLQVLKDWCQTYP 
60 GMSFLVTGPKDLVVSKNQVAVLGNGCTELDWITGTGDLVGALTAVFLSQGKTGFEASCLAVSYLNIAAEKIVVQGMG 
LEEFRYQVLNQLSLLRRDENWLDTIKGEVYEZ 



MNHKIAILSDVHGNATALEAVIADAKNQGASEYWLLGDIFLPGPGANDLVALLKDLPITASVRGNWDDRVLEALDGQ 
65 YGLEDPQEVQLLRMTQYLMERMDPATIVWLRSLPLLEKKEIDGLRFSISHNLPDKNYGGDLLVENDTEKFDQLLDAET 
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DVAVYGHVHKQLLRYGSQGQQIINPGSIGMPYFNWEALKNHRSQVAVIEVEDGELLNIQFRKVAYDYEAELELAKSKG 
LPFIEMYEELRRDDNYQGHNLELLASLIEKHGYVEDVKNFFDFLZ 

5 MNVNQIVRIIPTLKANNRKLNETFYIETLGMKALLEESAFLSLGDQTGLEKLVLEEAPSMRTRKVEGRKKLARLIVKVE 
NPLEIEGILSKTDSIHRLYKGQNGYAFE1FSPEDDLILIHAEDDIASLVEVGEKPEFQTDLASISLSKFEISMELHLPTDIESF 
LESSEIGASLDFIPAQGQDLTVDNTVTWDLSMLKFLVNELDIASLRQKFESTEYFIPKSEKFFLGKDRNNVELWFEEVZ 

10 MKWTKIIKKIEEQIEAGIYPGASFAYFKDNQWTEFYLGQSDPEHGLQTEAGLVYDLASVSKVVGVGTVCTFLWEIGQLD 
IDRLVIDFLPESDYPDITIRQLLTHATDLDPFIPNRDLLTAPELKEAMFHLNRRSQPAFLYSDVHFLLLGFILERIFNQDLD 
VILKDQVWKPWGMTETKFGPVELAVPTVRGVEAGIVHDPKARLLGRHAGSAGLFSTIKDLQIFLEHYLADDFARDLNQ 
NFSPLDDKERSLAWNLEGDWLDHTGYTGTFIMWNRQKQEATIFLSNRTYEKDERAQWILDRNQVMNLIRKEEZ 

15 

MMKKTYNHILVWGVIFYSICIVCFCFTPQEQSTVGVGTPGIQHLGRLVFLLTPFNSLWKLGEVSDIGQLCWIFLQNILNV 
FLFFPLIFQLLYLFPNLRKTKKVLLFSFLVSLGIECTQLILDFFFDFNRVFEIDDLWTNTLGGYLAWLLYKRLHKNKVRN 
Z 

20 

MKIPLLTFARHKFVYVLLTLLFLALVYRDVLMTYFFFDIHAPDLAKFDGQAIKNDLLKSALDFRILQFNLGFYQSFIIPIII 
VLLGFQYIELKNKVLRLSIGREVSYQGLKRKLTLQVASIPCLIYLVTVLIIAIITYFFGTFSPLGWNSLFSDGSGLQRLLDGE 
DCSYLFFTCVLLIGIFINAIYFLQIVDYVGNVTRSAITYLMFLWLGSiMLLYSALPYYMVPMTSLMQASYGDVSLMKLFTP 
YILYIVPYMVLEKYEDNVZ 

25 

MFKVLQKVGKAFMLPIAILPAAGLLLGIGGALSNPTTIATYPILDNSIFQSIFQVMSSAGEVVFSNLSLLLCVGLaGLAKR 
DKGTAALAGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALVVGIVAVYLHNRYNNIQLPSALGFFGGSRFVPIVTSF 
SSILIGFVFFVtWPPFQQLLVSTGGYISQAGPIGTFLYGFLMRLSGAVGLHHIIYPMFWYTELGGVETVAGQTVVGAQKIF 
30 FAQLADLAHSGLFTEGTRFFAGRFSTMMFGLPAACLAMYHSVPKNRRKKYAGLFFGVALTSFITGITEPIEFMFLFVSPV 
LYVVHAFLDGVSFFIADVLNISIGNTFSGGVIDFTLFGILQGNAKTNWVLQIPFGLIWSVLYYIIFRWFITQFNVLTPGRGE 
EVDSKEISESADSTSNTADYLKQDSLQIIRALGGSNNIEDVDACVTRLRVAVKEVNQVDKALLKQIGAVDVLEVKGGIQ 
AIYGAKAILYKNSINEILGVDDZ 

35 MKFRKLACTVLAGAAVLGLAACGNSGGSKDAAKSGGDGAKTEITWWAFPVFTQEKTGDGVGTYEKSirEAFEKANPDI 
KVKLETIDFKSGPEKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLFTDEFVKDVNNENIVQASKAGDKAYMYPI 
SSAPFYMAMNKKMLEDAGVANLVKEGWTTDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGSVTDEKV 
SKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKLLEASICVEVVEVPFPSD 
EGKPALEYLVNGFAVFNNKDDKKVAASKKFIQFIADDKEWGPKDVVRTGAFPVRTSFGICLYEDKRMETISGWTQYYSP 

40 YYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEKANETtKKAMKQZ 

MQSTEKKPLTAFTVISTnLLLLTVLFIFPFWILTGAFKSQPDTIVIPPQWFPKMPTMENFQQLMVQNPALQWMWNSVFI 
SLVTMFLVCATSSLAGYVLAKKRFYGQRILFAIFIAAMALPKQVVLVPLVRIVNFMGIHDTLWAVILPLIGWPFGVFLM 
45 KQFSENIPTELLESAKIDGCGEIRTFWSVAFPIVKPGFAALAIFTFINTWNDYFMQLVMLTSRNNLTISLGVATMQAEMA 
TNYGLIMAGAALAAVPIVTVFLVFQKSFTQGITMGAVKGZ 

MKIMFKNFNNILLNRKIVLLLRIVLMMIUNHLl^TAVQKQDAVIFFTCRELISIFSYNDYSEANLEIPKLLLNl^LFMVGW 
50 LSVILLESDLADHYHHLIRYQSSSFFDYTRKRLVVISKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMMLYLLLSYLI 
ALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVMLIVSRLEEKFKKGZ 

MGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSKTLLFAVDKEDIGQEIIGIAKGSIESMHNLPVNLAG 
55 ARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKKDYTYKAPLAQQALPETGNKESDLLASLG 
LTAFFLGLFTLGKKREQZ 



60 MKKTFFLLVLGLFCLLPLSVFAIDFKINSYQGDLYIHADNTAEFRQKIVYQFEEDFKGQIVGLGRAGKMPSGFDIDPHPKI 
QAAKNGAELADVTSEVTEEADGYTVRVYNPGQEGDIVEVDLVWNLKNLLFLYDDIAELNWQPLTDSSESIEKFEFHVR 
GDKGAEKLFFHTGKLFREGTIEKSNLDYTIRLDNLPAKRGVELHAYWPRTDFASARDQGLKGNRLEEFNKIEDSIVREK 
DQSKQLVTWVLPSILSISLLLSVCFYFIYRRKTTPSVKYAKNHRLYEPPMELEPMVLSEAVYSTSLEEVSPLVKGAGKFTF 
DQLIQATLLDVIDRGNVSI1SEGDAVGLRLVKEDGLSSFEKDCLNLAFSGKKEETLSNLFADYKVSDSLYRRAKVSDEKR 

65 IQARGLQLKSSFEEVLNQMQEGVRKRVSFWGLPDYYRPLTGGEKALQVGMGALTILPLFIGFGLFLYSLDVHGYLYLPL 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



PCT/GB99/02451 



85 



PILGFLGLVLSVFYYWKLRLDNRDGVLNEAGAEVYYLWTSFENMLREIARLDQAELESIVVWNRLLVYATLFGYADK 
VSHLMKVHQIQVENPDINLYVAYGWHSTFYHSTAQMSHYASVANTASTYSVSSGSGSSGGGFSGGGGGGSIGAFZ 



5 MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRTEAQTG 
EAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTPYQIIKVDGS 
EKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNA 
RRAERAGEATRSLIDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNTKDYSYUCL 
TNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVIFHITDGVPTMSYPINFNHAT 

10 FAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVYEKGAPAAFPVKPEKYSEMKAAGYA 
VIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNIAPDGYDVFTVGIGINGDPGTDEATATSFMQSISS 
KPENYTWTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELIDLQLGTDGRFDPADYTLTANDGSRLENGQAVGGP 
QNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVTLTYNVRLNDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKI 
RDVRKYPEITISKEKKLGDIEFIKVNKNDKKPLRGAVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDG 

15 KYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIG 
CMMMGGVLLYTRKHPZ 



MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKEIAG 
20 VMFVWTm'NNEIIDENGQTLGVNIDPQTFKLSGAMPATAMFCKLTEAEGAKFNTANLPAAKYKIYEIHSLSTYVGEDGA 
TLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIVTKIPALANYA 
TANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKITYSATLNDKAI 
VEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTGKVVQTVTLTTDKN 
TVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKKFVKVNDKDNRLAGAEF 
25 V1ANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDKAQAAYNAAVIAANNAF 
EWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSYSATGQGIEYTAGSGKDDAT 
KVVNKKmPQTGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLAZ 



30 MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QrVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAKKTDTM 
TTKVKLIKVDQDHNRLEGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGNYRFKEVEPL 
AGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVLQNGKEVVVTS 
GKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGEETLVYLDACCHFV 

35 vwz 



MSHIYLSIFTSLLLMLGLVNVAQADEYLRIGMEAAYAPFNWTQDDDSNGAVKIDGTNQYANGYDVQIAKKIAKDLGKE 
PLVVKTKWEGLVPALTSGKIDMIIAGMSPTAERKQEIAFSSSYYTSEPVLLVKKDSAYASAKSLDDFNGAKITSQQGVYL 

40 YNLIAQIPGAKKETAMGDFAQMRQALEAGVIDAYVSERPEALTAEAANSKFKMIQVEPGFKTGEEDTAIAIGLRKNDNR 
ISQINASIETISKDDQVALMDmiKEQPAEATTTEETSSSFFSQVAKILSENW<^LLRGAGrTLLISIVGTIIGLHGLAIGVFR 
TAPl^E^VIYGLQKLVGWVLNVYIEIFRGTPMIVQSMVIYYGTAQAFGINLDRTLAAlFIVSIhfTGAYMTEIVRGGILAV 
DKGQFEAATALGMTHNQTMRKIVLPQVVRNILPATGNEFVINIKDTSVLNVISVVELYFSGNTVATQTYQYFQTFTI1AV 

^ IYFVLTFTVTRILRFIERRMDMDTYTTGANQMQTEDLKZ 

MTQAILEIKHLKKSYGQNEVLKDISLTVHKGEVISIIGSSGSGICSTFLRSINLLETPTDGQILYHGQNVLEKGYDLTQYREK 
LGMVFQSFNLFENLNVLENTIVAQTTVLKRERTEAEKIAKENLEKVGMGERYWQAKPKQLSGGQKQRVAIARALSMN 
PDAILFDEPTSALDPEMVGEVLKIMQDI^QEGLTMIVVTHEMEFARDVSHRVIFMDKGVIAEEGKPEDLFTNPKEDRTK 
50 EFLQRYLKZ 



MKKYQLLFKISAVFSYLFFVFSLSQLTLIVQNYWQFSSQIGNLFWIQNILSLLFIGVMIVVLVKTGHGYLFRIPRKKWLW 
YSILTVLVLVFQtSFNVQTAKHVQSTAEGWAVLIGYSGTNFAELGIYlALFFLVPLMEELIYRGLLQHAFFFCHSRFGLDLL 
55 LPSILFALPHFSSLPSLLDIFVFATVGIIFAGLTRYTKSIYPSYAVHVINNIVATFPFLLTFLHRVLGZ 



MNKKQWLGLGLVAVAAVGLAACGNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDN 
GFTYFQSTSEADYANNLQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGYLA 
60 GVAAAKTTKTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVAG 
GTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTLKQVGTTVKDISNKAERGEFPGGQV 
IVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEKZ 
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MSKKLQQISVPLISVFLGILLGAIVMWIFGYDAIWGYEELFYTAFGSLRGIGEIFRAMGPLVLIGLGFAVASRAGFFNVGL 
PGQALAGWII^GWFAUHPDMPRPLMILATIVIALUGGIVGAIPGILRAYLGTSEVIVTIMMNYIVLYVGNAFIHAFPKD 
FMQSTDSTIRVGANATYQTPWLAELTGNSRMNIGIFFAIIAVAVIWFMLKKTTLGFEIRAVGLNPHASEYAGISAKRTIIL 
SMIISGALAGLGGAVEGLGTFQNVYVQGSSLAIGFNGMAVSLLAANSPIGILFAAFLFGVLQVGAPGMNAAQVPSELVSI 
5 VTASIIFFVSVHYLIERFVKPKKQVKGGKZ 

MGVKKKLKLTSLLG^LLIMTACATNGVTSDITAESADFWSKLVYFFAEIIRFLSFDISIGVGIILFTVLIRTVLLPVFQVQ 
MVASRKMQEAQPRIKALREQYPGRDMESRTKLEQEMRKVFKEMGVRQSDSLWPILIQMPVILALFQALSRVDFLKTGH 
10 FLWINLGSVDTTLVLPILAAVFTFLSTWl^NKAI^ERNGATTAMMYGIPVLIFIFAVYAPGGVALYWTVSNAYQVLQTY 
FLNNPFKIIAEREAVVQAQKDLENRKRKAKKKAQKTKZ 

MVIDPFAINELDYYLVSHFHSDHIDPYTAAAILNNPKLEHVKFIGPYHCGRIWEGWGVPKERIIVVKPGDTIELKDMKIH 
15 AVESFDRTCLVTLPVNGADETGGELAGLAVTDEEMAQKAVNYIFETPGGTIYHGADSHFSNYFAKHGKDFKIDVALNN 
YGENPVGIQDKMTSIDLLRMAENLRTKVIIPVHYDIWSNFMASTNEILELWKMRKDRLQYDFHPFIWEVGGKYTYPQD 
QHLVEYHHPRGFDDCFEQDSNIQFKALLZ 

20 MFLSGWLSSFANTYIHDLLGVLFPDSPFLNAFESAIAAPLVEEPLKLLSLVFVLALIPVRKLKSLFLLGIASGLGFQMIKDI 
GYIRTDLPEGFDFTISRILERIISGIASHWTFSGLAVVGVYLLYRAYKGQKVGKKQGLIFLGLALGTHFLFNSPFVELETEL 
PLAIPVVTAIALYGFYHAYCFVEKHNELMTZ 

25 MKVEPRCDVI^RMSHFFIRILIMELQELVERSWAIRQAYHELEVKHHDSKWTVEEDLLALSNDIGNFQRLVMTKQGRY 
YDETPYTLEQKLSENIWWLLELSQRLDIDILTEMENFLSDKEKQLNVRTWKZ 

MLDWKQFFLAYLRSRSRLFIYLLSLAFLVLLFQFLFASLGIYFLYFFFLCCFVTILFFTWDILVETQVYRQELLYGEREAK 
30 SPLEIALAEKLEAREMELYQQRSKAERKLTDLLDYYTLWVHQIKTPIAASQLLVAEVVDRQLKQQLEQEIFKIDSYTNLV 
LQYLRLESFHDDLVLKQVQIEDLVKEIIRKYALFFIQKGLNVNLHDLDKEIVTDKKWLLVVIEQIISNSLKYTKEGGLEIY 
MDDQELCIKDTGIGIKNSDVLRVFERGFSGYNGRLTQQSSGLGLYLSKKISEELGHQIRIESEVGKGTTVRIQFAQVNLVL 
EZ 

35 

MEL^HNAEILI^AANKSHYPQDELPEIALAGRSNVGKSSFI^MLNRKNLARTSGKPGKTQLLNFFNIDDKMRFVDVP 
GYGYARVSKKEREKWGCMIEEYLTTRENLRAVVSLVDLRHDPSADDVQMYEFLKYYE1PVIIVATKADKIPRGKWNKH 
ESAIKKKLNFDPSDDFILFSSVSKAGMDEAWDAILEKLZ 

40 

MTKKQLHLVIVTGMSGAGKTVAIQSFEDLGYFTIDNMPPALLPKFLQLVEIKEDNPKLALVVDMRSRSFFSEIQAVLDEL 
ENQDGLDFKILFLDAADKELVARYKETRRSHPLAADGRILDGIKLERELLAPLKNMSQNVVDTTELTPRELRKTLAEQF 
SDQEQAQSFRIEVMSFGFKYGIPIDADLVFDVRFLPNPYYLPELRNQTGVDEPVYDYVMNHPESEDFYQHLLALIEPILP 
SYQKEGKSVLTIAMGCTGGQHRSVAFAKRLAQDLSKNWSVNEGHRDKDRRKETVNRSZ 

45 

MRKPKITVIGGGTGSPVrLKSLREKDVElAAIVTVADDGGSSGELRKNMQQLTPPGDLRNVLVAMSDMPKFYEKVFQYR 
FSEDAGAFAGHPLGNLIIAGLSEMQGSTYNAMQLLSKFFHTTGKIYPSSDHPLTLHAVFQDGTEVAGESHIVDHRGIIDN 
VYVTNALNDDTPLASRRVVQTILESDMIVLGPGSLFTSILPNIVIKEIGRALLETKAEIAYVCNIMTQRGETEHFTDSDHV 
50 EVLHRHLGRPFIDTVLVNIEKVPQEYMNSNRFDEYLVQVEHDFVGLCKQVSRVISSNFLRLENGGAFHDGDLIVDELMR 

IQVKKZ 

55 MKNLIKLLIIRLIVNLADSVFYIVALWHVSNNYSSSMFLGIHAVNYLPDLLLIFFGPVIDRVNPQKILIISILVQLAVAVIFL 
LLLNQISFWVIMSLVFISVMASSISYVIEDVLIPQVVEYDKIVFANSLFSISYKVLDSIFNSFASFLQVAVGFILLVKIDIGIFL 
LALFILLLLKFRTSNANIENFSFKYYKREVLQGTKFILNNKLLFKTSISLTLINFFYSFQTVVVPIFSIRYFDGPIFYGIFLT1A 
GLGGILGNMLAPIVIKYLKSNQIVGVFLFLNGSSWLVAIVIKDYTLSLILFFVCFMSKGVFNIIFNSLYQQIPPHQLLGRVN 
TTIDSIISFGMPIGSLVAGTLIDLNIELVLIAISIPYFLFSYIFYTDNGLKEFSIYZ 

60 

MMSNKNKEILIFAILYTVLFMFDGVKLLASLMPSAIANYLVYVVLALYGSFLFKDRLIQQWKEIRKTKRKFFFGVLTGW 
LFLILMTVVFEFVSEMLKQFVGLDGQGLNQSNIQSTFQEQPLLIAVFACVIGPLVEELFFRQVLLHYLQERLSGLLSIILV 
GLVFALTHMHSLALSEWIGAVGYLGGGLAFSIIYVKEKENIYYPLLVHMLSNSLSLIILAISIVKZ 

65 
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LKKPIIEFKNVSKVFEDS^KVLKDINFELEEGKFYTLLGASGSGKSTILNIIAGLLDATTGDIMLDGVRINDIPTNKRDVH 
TVFQSYALFPHMNVFENVAFPLIUJIKIDKKEIEQRVAEVLKMVQLEGYEKR5IRKLSGGQRQRVAIARAIINQPRVVLLD 
EPLSALDLKLRTDMQYELRELQQRLGITFVFVTHDQEEALAMSDWrFVMNDGEIVQSGTPVDIYDEPINHFVATFIGESN 
5 ILPGTMIEDYLVEFNGKRFEAVDGGMKPNEPVEVVIRPEDLRITLPEEGKLQVKVDTQLFRGVHYEIIAYDELGNEWMI 
HSTRKAJVGEEIGLDFEPEDIHIMRLNETEEEFDARIEEYVEIEEQEAGLINAIEEERDEENKLZ 



MKSMRILFLLAUQISLSSCFLWKECILSFKQSTAFFIGSMVFVSGICAGVNYLYTRKQEVHSVLASKKSVKLFYSMLLLIN 
10 LLGAVLVI^DNLFIK^LQQELVDFLLPSFFFLFGLDLUFLPLKKYVRDFLAMLDRKKTVLVTILATLLFLRNPMTIVSL 
LIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVIVTLIFFZ 



MVKKIIGMVLALI^VTVVGVGVFAYTIYQQGTETLAKTYKKIGEETKVIEATEPLTILLMGVDTGNVERTETWVGRSDS 
15 MILMTVNPKTKKTTMMSLERDILTRIESGNGQAHEAKLNSAYADGGAELAIETIQKMMNIHIDRYVMVNMRGLQKLV 
DAVGGITVNNILGFPISISDQEEFNT1SIGVGEQHIGGEEALVYARMRYQDPEGDYGRQKRQREVIQKVMEKALSLNSIGH 
YQEILKAL^DNMQTNIDLSAKSIPNLLGYKDSFKTIETQQLQGEGEILQGVSYQIVSRAHMLEMQhTLLRRSLGQEEVTQL 
ETNAVLFEDLFGRAPVGDEDNZ 

20 

MKKQAYVHALTSFLFVFFFSHSLLEILDFDWSIFLHDVEiaEKFVFLLLVFSMSMTCLLALFWRGIEELSLRKMQANLK 
RLLAGQEVVQVADPDLDASFKSLSGKLNLLTEALQKAENQSLAQEEEIIEKERKRIARDLHDTVSQELFAAHMILSGISQ 
QALKLDREKMQTQLQSVTAILETAQKDLRVLLLHLRPVELEQKSLIEGIQILLKELEDKSDLRVSLKQNMTKLPKKIEEHI 
FRILQELISNTLRHAQASCLDVYLYQTDVELQLKVVDNGIGFQLGSLDDLSYGLRNIKERVEDMAGTVQLLTAPKQGLA 
25 VDIRIPLLDKEZ 



MIVSIISQGFVWAILGLGIFMTFRILNFPDMTTEGSFPLGGAVAVTLrTKGVNPFLATLVAVGAGCLAGMAAGLLYTKGK 
IPTLLSGILVMTSCHSIMLLIMGRANLGLLGTKQIQDVLPFDSDLNQLLTGLIFVSIVIALMLFFLDTKLGQAYIATGDNP 
30 DMARSFGIHTGRMELMGLVLSNGVIALAGALIAQQEGYADVSRGIGVIVVGLASLIIGEVIFKSLSLAERLVTrVVGSIAY 
QFLVWAVIALGFNTSYLRLYSALILAVCLMIPTFKQTILICGAKLSKZ 



MKKMKVWSTVLATGVALTTLAACSGGSNSTTASSSEEKADKSQELVIYSNSVSNGRGDWLTAKAKEAGFNIKMVDIAG 
35 AQLADRVIAEKNNAVADMVFGIGAVDSNKIRDQKLLVQYKPKWLDKIDQSLSDKDNYYNPVIVQPLVLiGAPDVKEMP 
KDWTELGSKYKGKYSISGLQGGTGRAILASILVRYLDDKGELGVSEKGWEVAKEYUCNAYTLQKGESSIVKMLDKEDPI 
QYGMMWGSGALVGQKEQNVVFKVMTPEIGVPFVTEQTMVLSTSKKQALAKEFIDWFGQSE[QVEYSKNFGSIPANKD 
ALKDLPEDTKKFVDQVKPQNIDWEAVGKHLDEWVEKAELEYVQZ 

40 

MIKFDNIQIKYGDFVAIDNLNLDIHEGEFFTFLGPSGCGKSTTLRALVGFLDPSSGSIEVNGTDVTHLEPEKRGIGIVFQSY 
ALFPTMTVFDNIAFGLKVKKVAPDVIKAKVSAVAAKIKISDQQLQRNVSELSGGQQQRVALARALVLEPKILCLDEPLS 
NLDAKLRVDLRKELKRLQKELGITTLYVTHDQEEALTLSDR1AVFNNGYIEQVGTPVEIYHNSQTEFVCDFIGDINVLTD 
ETVHEVLLKNTSVFLEDKKGYIRLEKVRFNRETEQDFILKGTIIDVEFSGVTIHYTIKVSESQILNVTSIDSQAAIR5VGESV 
45 ELFITPSDVLQFZ 



MRHKLNLKDWLIRLGLIWFLVTniYPNFDLVVNVFVKGGEFSLDAVHRVLKSQRALQSIMNSFKLAFSUITVNVVGIL 
CVLFTEYFDIKGAKILKLGYNTTSLIYGGVVLATGYKFVYGPYGLrrKFLQNVIPSLDPNWFIGYGAVLFIMTFSGTANHT 
50 LFLTrn*IRSVDYHTIEAARNMGAKPFTVFRKVVLPTLIPTLFALTrMVFl^GL^AVAAPMIVGGKEFQTINPMriTFAGMG 
NSRDLAALLAIILGIATTILLTIMNKIEKGGNYIS1SKTKAPLKKQKIASKPWNIIAHIVAYGLFTVFMLPLIFIVLYSFTDPV 
AIQTGNLTLSNFTLENYRLFFSNSAAFSPFLVSF1YSHAATTATILAVVFARVVRKHKSRFDFLFEYGALLPWLLPSTLLA 
VSLLFTFNQPQFLVLNQILVGSLVILLIAYIVVKIPFSYRMVRAILFSVDDEMEDAARSMGASPFYTMMKVIIPFILPVVLS 
VIALNFNSLLTDFDLSVFLYHPLAQPLGITIRSAGDETATSNAQALVFVYTIVLMIISGTVLYFTQRPGRKVRKZ 
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Table 3 

ID201 - 4106.4 

5 ATGATAAAAAATCCTAAATTATTAACCAAGTCTTTTTTAAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCAT 
TCAT AT AG CT ATTT ATTT GACCT TT CCTTTTT ATT ATATTCAACTGG AGGGGG AAAAGTTTAATGAG AGCGCAAGAG 
TGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACTCCAGTCTTATTCAAAGTCCTTGACC 
ATAT CTG CTCAC CTTAAAAGAGAT ATTGTAGATAAGCGGCTCCCTCTTGTGCATG ACTTGGATATTAAAGATGGAAA 
G CTAT C AAAT T ATAT CGT G ATGT T AG AT ATGTCTGTTAGT ACAGCAGATGGT AAACAGGTAAC CGTGCAATTTGTT C 

10 ACGGGGTGGATGTCTACAAAGAAGCAAAGAATATTTTGCTTTTGTATCTCCCATATACATTTTTGGTTACAATTGCT 
TTTTCCTTTGTTTTTTCTTATTTTTATACTAAACGCTTGCTCAATCCTCTTTTTTACATTTCAGAAGTGACTAGTAA 
AATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTGGAAAACAGATTA 
ATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAAATTGCAAAAT 
CAAAAGGTTTCCTTTGTC CGCGGAG CATCACATGAGTTGAAAACC C CTTTAG CCAGTCTTAGAATT ATCCTAGAGAA 

15 TATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGATTGACCAGATGA 
GCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGACCTTGACTGTTAAG 
CCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCATTCAATAGGTGTTACAATTGAAAATCAATTGAC 
AGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTAGTAATGCAATTAAAT 
ATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAAATACATGTGCGCCTCTA 

20 AGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGATAAGGATGAAAGTTCCGG 
TTTGGGTCTTTACATTGTGAATAATATTTTAGAAAGCTATCAAATGGATTATAGTTTTCTCCCTTATGAACACGGTA 
TGG AATTTAAG ATT AG CTTGT AG 



25 MIKNPKLLTKSFLRSFAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEIPSLLQSYSKSLT 
I S AHLKRDI VDKRLPLVHDLDI KDGKLSNYI VMLDMSVSTADGKQVTVQFVHGVDVYKEAKNILLLYLPYTFLVTIA 
FSFVFSYFYTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQN 
QKVSFVRGASHELKTPLASLRIILENMQHNIGDYKDHPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETLTVK 
PVLVDILSRYQELAHSIGVTIENQLTDATRWMSLRALDKVLTNLISNAI KYSDKNGRVIISEQDGYLSIKNTCAPL 

30 SDQELEHLFDIFYHSQIVTDKDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

ID202 - 4106.9 

ATGGATAAAATTATTAAAACTATATCAGAAAGCGGAGCCTTTCGTGCTTTTGTCCTTGATAGCACTGAAACCGTCCG 
35 CACTGCTCAAGAAAAACATCAAACCC^GCTAGCTCAACTGTAGCGCTTGGTCGAACTCTTATCGCTAGCCAGATTC 
TCGCAGCCAATGAAAAAGGAAATACCAAACTTACAGTTAAGGTGTTGGGATCTAGCTCTCTAGGTGCTATTATCACC 
GTCGCTGATACCAAGGGGAACGTCAAAGGCTATGTTCAAAATCCTGGTGTTGACATCAAAAAGACTGCGACTGGTGA 
AGTCCTAGTCGGACCTTTTGTTGGAAATGGTCAATTCCTCGTTATCACAGACTACGGTACTGGAAATCCTTACAACT 
CTATAACTCCCCTCATCTCTGGAGAAATCGGTGAAGACCTTGCCTTTTACCTTACTGAAAGCCAACAAACGCCTTCA 
40 GCGGTCGGCCTCAATGTCCTTTTGGACGAGGAAGACAAGGTCAAGGTTGCAGGTGGTTTCCTAGTTCAAGTCTTGCC 
AGGAGCCAAGAAAGAAGAGATTGCTCGCTTTGAAAAACGCATCCAAGAAATGCCAGCTATCTCTACTCTTCTCGAAA 
GCGACGACCATATCGAAGCCCTCCTCAAGGCTATCTACGGGGACGAAGCCTACAAGCGTCTTTCTGAAGAAGAAATC 
CGTTTCCAATGTGACTGTAGCCATGAACGCTTTATGAACGCTCTTGCCAGCCTTCCAAGCTCAGACTTACAGGAAAT 
GAAAG AGGAAGACCACGGGG CAGAAAT C ACTTGTCAATTCTG C C AAACTACTT AC AACTTTGATG AAAAGG AC CTGG 
45 AGGAACTCATTCGTGACAAATCTTAA 



MDK 1 1 KT I S E S G A FRA FVLDS TETVRT AQ E KHQTQAS S TV ALGRTL I ASQ I LAANE KGN T KLTVKVLGS S S LG A 1 1 T 
VADTKGNVKGYVQNPGVDIKKTATGEVLVGPFVGNGQFLVITDYGTGNPYNSITPLISGEIGEDLAFYLTESQQTPS 
50 AVGLNVLLDEEDKVKVAGGFLVQVLPGAKKEEIARFEKRIQEMPAISTLLESDDHIEALLKAIYGDEAYKRLSEEEI 
RFQCDCSHERFMNALASLPSSDLQEMKEEDHGAEITCQFCQTTYNFDEKDLEELIRDKSZ 



ID203 - 4115 

55 

ATGAAATCAATAACTAAAAAGATTAAAGCAACTCTTGCAGGAGTAGCTGCCTTGTTTGCAGTATTTGCTCCATCATT 
TGTATCTGCTCAAGAATCATCAACTTACACTGTTAAAGAAGGTGATACACTTTCAGAAATCGCTGAAACTCACAACA 
CAACAGTTGAAAAATTGGCAGAAAACAACCACATTGATAACATTCATTTGATTTATGTTGATCAAGAGTTGGTTATC 
GATGGCCCTGTAGCGCCTGTTGCAACACCAGCGCCAGCTACTTATGCGGCACCAGCCGCTCAAGATGAAACTGTTTC 
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AGCTCCAGTAGCAGAAACTCCAGTAGTAAGTGAAACAGTTGTTTCAACTGTAAGCGGATCTGAAGCAGAAGCCAAAG 
AATGG ATCGCT CAAAAAG AATCAGGTGGT AGT AT ACAGCT AC AAATGG ACGTTAT ATCGG ACGTTAC CAATTAA 



5 MKSITKKIKATI^GVAALFAVFAPSFVSAQESSTYTVKEGDTLSEIAETHNTTVEKLAEmmiDNIHLIYVDQELVI 
DGPVAPVATPAPATYAAPAAQDETVSAPVAETPWSETVVSTVSGSEAEAKEWIAQKESGGSIQLQMDVISDVTNZ 

ID204 - 4117.1 

1 0 ATGAATTTAGGAGAATTTTGGTACAATAAAATAAATAAGAACAGAGGAAGAAGGTTAATGAAGAAAGTAAGATTTAT 
TTTTTTAGCTCTGCTATTTTTCTTAGCTAGTCCAGAGGGTGCAATGGCTAGTGATGGTACTTGGCAAGGAAAACAGT 
ATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACTCATTATCAATCTTGGTTCTATATAAAA 
GCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACTATTTTTACCTCAAATCTGGTGGCTATAT 
GGC C AAATCAGAATGGGTAG AAG ACAAGGG AG CCTTTTATTAT CTTG AC CAAG ATGG AAAGATGAAAAG AAATGCTT 

15 GGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCTCAATACGATGCT 
TGGTTTTAT ATCAAAGCAGATGG AC AG C ACG CAG AG AAAGAATGG CTC C AAATTAAAGGGAAGGACTATT ATTTCAA 
ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGTACAGCAAG 
GTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATCAAAGAAAATGGAAACTATGCTGATAAAGAATGGATT 
TTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATGGCAGCCAATGAATGGATTTGGGATAAGGAATC 

20 T TGGTTTTAT CTC^UVATTTGATGGGAAAATGGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACT 

ACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATCTGAT 
GGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACAT 
G AC AG C CAATGAATGGATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGCTGAAAAAG AAT 
GGGTCTACGATTCT CATAGTCAAG CTTGGT ACTACTTCAAATCTGGTGG CTACATGGCG AAAAATGAGACAGTAGAT 

25 GGTTATCAG CTTGGAAG CGATGGT AAATGGCTTGGAGGAAAAACTACAAATGAAAATGCTGCTTACT ATCAAGTAGT 

GCCTGTTAC^GCC^ATGTTTATGATTCAGATGGTGAAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAG 
AT AAGG AT AG AAAAAGTGATG ACAAG CG CTTGGCT ATT ACTATTTCTGGTTTGTCAGG CT AT ATGAAAACAGAAG AT 
TTACAAGCGCTAGATGCTAGTAAGGACTTTATCCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGC 
TCAGAATGCTAGTATCCCAGTAGCTT CTCATCTTTCTGATATGGAAGTAGGC AAGAAAT ATTATTCGG CAGATGGCC 

30 TGCATTTTGATGGTTTTAAGCTTGAGAATCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGM 
GAATTGG ATAAGGTATTTAGTTTG CT AAACATTAAC AAT AGCCTTTTGGAGAAC AAGGGCG CTACTTTT AAGGAAG C 
CGAAG AACATTACCATATCAATGCT C TTTATCTCCTTGCCC AT AGTGC C CTAG AAAGTAACTGGGGAAGAAGTAAAA 
TTGC CAAAGATAAGAATAATTT CTTTGG CATTACAGCCT ATG ATACGACCCCTTAC CTTT CTG CTAAGAC ATTTGAT 
GATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGG 

3 5 AAACAAGG CTTCTGGTATG AATGTGGAAT ATGCTTC AG ACCCTT ATTGGGGCGAAAAAATTG C TAGTGTG ATGATG A 

AAATCAATGAGAAGCT AGGTGG CAAAGATTAG 



MNLGEFWYNKINKNRGRRIJ*KKVRFIFI^LFFLASPEGAM^ 

40 ADANYAENEWLKQGDDYFYLKSGGYMAKSEVA/EDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDA 
WFYI KADGQHAEKEWLQI KGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYI KENGNYADKEWI 
FENGHYYYLKSGGYMAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSD 
GKIAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMAKNETVD 
GYQLGS DGKWLGGKTTNENAAYYQWPVTANVYDSDGE KLS Y I SQGS WWLDKDRKSDDKRLAI TI SGLSGYMKTED 

45 LQALDASKDFIPYYESDGHRFYHYVAQNAS I PVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAE 

E LDKV FS L LN I NNS LLEN KG AT F KE AE EH YH I N AL Y L LAHS AL E S NWGRS KI AKD KNN F FG I T A YDTT P YL S AKTFD 
DVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

ID205 - 4118.1 

50 ATGAAAAAATTAGGTACATTACTCGTTCTCTTTCTTTCTGCAATCATTCTTGTAGCATGTGCTAGCGGAAAAAAAGA 
TACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGCTGATATTACTAAAAATATTGCTGGTG 
ACAAAATTG ACCTTCATAGT ATCGTTCCGATTGGG C AAG AC C C ACACGAAT ACGAACCACTTCCTGAAGACGTTAAG 
AAAACTTCTGAGGCTAATTTGATTTTCTATAACGGTATCAACCTTGAAACAGGTGGCAATGCTTGGTTTACAAAATT 
GGT AG AAAATGCC AAG AAAACTGAAAACAAAGACT ACTTCG CAGTC AGCGACGGCGTTGATGTT AT CT AC CTTGAAG 

55 GTCAAAATG AAAAAGGAAAAG AAG ACCC ACACGC TTGG CTTAAC CTTGAAAACGGT ATT ATTTTTGCT AAAAATATC 

GC CAAACAATTG AGCGCC AAAGACCCT AAC AAT AAAGAATT CT ATGAAAAAAATCTCAAAG AAT ATACTG ATAAGTT 
AG AC AAACTTGATAAAG AAAGT AAGG AT AAATTTAAT AAGAT C C CTGCTGAAAAGAAACTC ATTGT AACCAGCGAAG 
GAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATACTGAAGAAGAAGGA 
ACTCCTGAACAAATCAAGACCTTGGTTGAAAAACTTCGCCAAACAAAAGTTCCATCACTCTTTGTAGAATCAAGTGT 

60 GGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAATCTACGCTCAAATCTTTACTGACTCTATCG 
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CAGAAC AAGGT AAAG AAGG CG ACAGCTACT AC AGC ATG ATG AAAT AC AACCTTGACAAG ATTGCTGAAGG ATTGG CA 
AAATAA 



MKKLGTLLVLFLSAIILVACASGKKDTTSGOKLKWATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVK 
KTSEANLI FYNG INLETGGNAWFTKLVENAKKTENKDY FAVSDGVDVI YLEGQNEKGKEDPHAWLNLENGI I FAKNI 
AKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKI PAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEG 
TPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNLDKIAEGLA 
KZ 

ID206 - 4119.1 

ATGGAATGGTATAAAAAAATCGGACTTCTTGCAACTACAGGTTTAGCTTTGTTTGGGCTCGGCGCTTGCTCCAACTA 
TGGTAAATCTGCGGATGGOVCAGTGACCATCGAGTATTTCAACCAGAAAAAAGAAATGACCAAAACCTTGGAAGAAA 
TCACTCGTGATTTTGAGAAGGAAAACCCTAAGATC AAGGT CAAAGT CGTCAATGTACC AAATGCTGGTG AAGTATTG 
AAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATATTTACCCACAGTCCATCGAACTGCAAGAATGGGC 
AAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGACTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAATATG 
CTGTAAACGAAAAAGTTTACAACGTTCCTTTTACAGCTAATGCTTATGGAATTTACT^ 

GAACTGGGCTTG AAGGTT C CTGAAAC CTGGGATGAATTTGAACAGTT AGTC AAAGATATCGTTGCT AAAGGACAAAC 
ACCATTTGGAATTGCAGGTGCAGATGCTTGGACACTCAATGGTTACAATCAATTAGCCT^ 

GAGGAAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAATTGTCGGATCCGATTATGAAAGAT 
GATATCAAGGTCATGGACATCCTTCGCAT CAATGG ATCTAAG CAAAAGAACTGGG AAGGTG CTGGCT AT ACCGATGT 
TATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGATCACAGCGATTAATGAACAAA 
AACCG AACTTTAAGATTGGG ACCTTCATG ATTCCAGGAAAAG AAAAAGGAC AAAG CTTAAC CGT TGGTGCGGG AG AC 
TTGG CATGGTCT ATCTCAGCCACCACCAAACATC CAAAAGAAGCCAATGCCTTTGTGG AAT ATATGACCCGTCCAGA 
AGTCATGCAAAAATACTACGATGTGGACGGATCTCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGATTCAC 
CGCTTGCTGGTATGACCGAATATGCCTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACTGGACCAGTGAAGCA 
GACTTCCATACCTTGACCATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTT 
TAACCCGATG AAAG CGGATGTGGATTAG 



ME W Y KK I GLLATTGLAL FGLG ACS NYGKS ADGTVT I E Y FNQKKEMT KT LE E I T RD FE KEN P K I KVKWNVP N AGE V L 
KTRVIAGDVPDVWIYPQSIELQEWAKAGVFEDLSNKDYLKRVKNGYAEKYAVNEKVYNVPFTANAYGIYYN 
ELGLKVPETWDEFEQLVKDIVAKGQTPFGIAGADAWTI^GYNQLAFATATGGGKEANQYLRYSQPNAIKLSDPIM 
DIKVMDILRINGSKQKNWEGAGYTDVIGAFARGDVLMTPNGSWAITAINEQKPNFKIGTFMIPGKEKGQSLiTVGAGD 
LAWS ISATTKHPKEANAFVEYMTRPEVMQKYYDVDGSPTAI EGVKQAGEDSPLAGMTEYAFTDRHLVWLQQYWTSEA 
D FHTLTMNYVLTGD KQGMVNDLNAF FN PMKADVD Z 

ID207 - 4123.1 

ATG AAG AAAAT C AAACCGCATGG ACCGTT AC CAAGT C AG ACT C AG CTAG CTTATCTGGGAG ATGAACT AGCAG C TTT 
TATCCACTTCGGTCCTAATACCTTTTATGACCAAGAATGGGGGACTGGACAGGAGGATCCTGAGCGCTTTAACCCGA 
GTCAGTTGGATGCGCGTGAGTGGGTTCGTGTGCTCAAGGAAACGGGCTTCAAAAAGTTGATTTTGGTGGTCAAGCAC 
C ACGATGGCTTTGTCCTTTATCCGAC AGCT CACACAGATTATTC GGTT AAGGT CAGTCCTTGGAGGAG AGG AAAGGG 
CGACTTGCTCCTTGAAGTATCCCAAGCTGCCACAGAGTTTGATATGGATATGGGGGTCTACCTGTCACCGTGGGATG 
CCCATAGTCCCCTCTATCATGTGGACCGAGAAGCGGACTACAATGCCTATTATCTGGCTCAGTTGAAGGAAATCTTA 
TC^AATCCTAACTATGGGAATGCTGGTAAGTTCGCTGAGGTTTGGATGGATGGTGCCAGAGGAGAGGGCGCGCAAAA 
GGTT AATT ATG AATTTGAAAAATGGTTTGAAAC C ATTCGTGAC CTG CAGGGCG ATTGCTTG ATTTTTTCAACAGAAG 
GCACCAGTATCCGCTGGATTGGCAATGAACGAGGGTATGCAGGTGATCCACTGTGGCAAAAGGTGAATCCTGATAAA 
CTAGGAACAGAAGC AGAGCTGAACTAT CTTCAGCACGGGG ATCCCTCGGGCACG ATTT TTTCAATCGGAG AGG CAGA 
TGTTTCCATCCGTCCAGGCTGGTTCTACCATGAGGATCAGGATCCTAAGTCTCTCGAGGAGTTGGTCGAAATCTACT 
TTCACTCAGTAGGGCGAGGAACTCCACTCTTGCTTAATATTCCGCCGAATCAAGCTGGGCTCTTTGATGCAAAGGAT 
ATTGAACGACTTTATGAATTTGCGACCTATCGCAATGAGCTCTATAAAGAAGATTTGGCTCTGGGAGCTGAGGTATC 
TGGTCCAGCTCTTTCCGCAGACTTTGCTTGTCGCCATTTGACAGACGGCCTTGAGACCAGCTCTTGGGCAAGCGATG 
CAGACTTGCCCATCCAGTTAGAACTCGACTTAGGTTCTCCTAAAACTTTTGATGTAATTGAGTTAAGAGAAGATTTG 
AAGCTAGGGCAACGAATCGCTGCTTTTCATGTGCAAGTAGAGGTGGATGGTGTCTGGCAGGAGTTTGGTTCGGGTCA 
T ACTGTTG GTT AC AAACGT CT CTTACGAGG AGC AGTTGTTGAGGCACAGAAG ATACGTGT AGT CATT ACAGAATCAC 
AGGCTTTGCCTTTGTTGACCAAGATTTCCCTTTATAAAACTCCTGGATTATCAAAAAAAGAAGTTGTTCAGGAACTA 
GCATTTGCAGAAAAAAGCCTAGCTGTGGCAAAGGGAGAAAATGCCTATTTTACAGTTAAGCGCAGAGAATGTAGTGG 
TCCTTTAGAAGCT AAG ATTT CG ATTCAACCGGGG ACAGGTGTCC ATGGTG T CGC CT AT C AGG ATG AG ATTCAAGTCC 
TTGCGTTTCAAACTGGTGAGACTGAAAAAAGTCTGACGCTACCAACCTTGTATTTCGCAGGAGATAAAACCTTGGAT 
TTCTATCTGAACCTAACGGTGGATGGTCAGCTTGTGGATCAACTTCAAGTCCAAGTTTCATAA 
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MKKIKPHGPLPSQTQLAYLGDELAAFIHFGPNTFYDQEWGTGQEDPERFNPSQLDAREWVRVLKETGFKKLILVVKH 
HTX3FVLYPTAHTDYSVKVSPWRRGKGDLLLEVSQAATEFDMDMGVYLSPWDAHSPLYHVDREADYNAYYLAQLKEIL 
SNPNYGNAGKFAEVWMDGARGEGAQKVNYEFEKWFETIRDLQGDCLIFSTEGTSIRWIGNERGYAGDPLWQKVNPDK 
5 LGTEAELNYLQHGDPSGTIFSIGEADVSIRPGWFYHEDQDPKSLEELVEIYFHSVGRGTPLLLNIPPNQAGLFDAKD 
IERLYEFATYRNELYKEDLALGAEVSGPALSADFACRHLTDGLETSSWASDADLPIQLELDLGSPKTFDVIELREDL 
KLGQRIAAFHVQVEVDGWQEFGSGHWGYKRLLRGAWEAQKIRVVITESQALPLLTKISLYKTPGLSKKEWQEL 
AFAEKSLAVAKGENAYFTVKRRECSGPLEAKISIQPGTGVHGVAYQDEIQVLAFQTGETEKSLTLPTLYFAGDKTLD 
FYLNLTVDGQLVDQLQVQVSZ 

10 

ID208 - 4125.12 

ATGCTTGAAAGACTGAAAAGAATACATTATATGTTTTGGATCAGTTTAATTTTTATGATTTTCCCCATCCTGTCTGT 
AGTGACTGGGTGGCITTCTGCCTGGCATTTATTGATTGATATTCTATTTGTAGTGGCATATTTGGGTGTTTTAACAA 
CTAAGAGCCAGCGCCTATCTTGGCTATATTGGGGCCTCATGCTGACTTATGTAGTTGGGAATACTGCCTTTGTTGCT 

15 GTT AATT AT AT CTGGTTTTTCTTTTTCCTATCCAAT CTCTT AAGTTATC ATTTCAG CGT ACGTAGTTT AAAGTCTTT 

ACATGTCTGGACTTTTCTTCTTGCTCAAGTCCTTGTTGTGGGGCAACTGTTGATTTTTCAGAGAATCGAAGTTGAGT 
TTCTATTCTATCTACTTGTAATTCTTACTTTTGTCGATTTAATGACTTTTGGATTGGTTCGGATTCGTATTGTCGAG 
GATTTGAAAGAAGCTCAGGTCAAG C AAAATGCTC AG ATAAATCT ATTG CTTGCTGAAAATGAACGT AGTCGT ATCGG 
TCAGGATTTGCATGATAGTCTGGGACATACCTTTGCTATGCTGAGTGTCAAGACAGATTTAGCCTTGCAGTTATTTC 

20 AGATGGAGGCTTATCCACAGGTGGAAAAGGAATTAAAAGAAATTCACCAGATCAGCAAGGATCCATGA 



MLERLKR I H YM FWISLIFMIFPI LS WTGWLS AWH L L I D I L FW A Y LGVLTT KS Q RL S W L YWG LMLT YWGNT A FVA 
VNYIWFFFFLSNLLSYHFSVRSLKSLHVWTFLLAQVLWGQLLIFQRIEVEFLFYLLVILTFVDLMTFGLVRIRIVE 
25 DLKEAQVKQNAQINLLLAENERSRIGQDLHDSLGHTFAMLSVKTDLAI^LFQMEAYPQVEKELKEIHQISKDPZ 

ID209 - 4126.3 

ATGAATGATAAGTTAAAAATCTTCTTGTTGCTAGGAGTATTTTTTCTAGCCATAACCGGTTTCTATGTTCTATTGAT 
ACGAAATGCAGGGCAGACAGATGCCTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGTGAAAAAAA 

30 CAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTC 
TCACGTGTT CCCGGGGTG ACGG ATGCCGTTTC AGG CT ATGCAAATGGTAG AGG AGAAAC AAC CAAGT ACGAATTG AT 
TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAATCCTGCTTC 
ACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCAGTACCGTACTGGTGTT 
T ATTAC AC AG ATGACAAGGATTTGGAAGTG ATT AACCAAGT CTTTGATG AGGTGGCT AAG AAAT ACGATC AAC CTCT 

35 AGCAGTTGAAAAGGAAAACTTGAAGAATTTTGTGGTGGCTGAGGATTACCATCAAGACTATCTCAAGAAAAATCCAA 
ATGG CT ACTGC CATATC AATGTTAATCAGGCGGCCT ATCCTGTC ATTGATGC CAGCAAAT ATCC AAAACC AAGTGAT 
GAGGAATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTCAAA 
CCGT T ACTGGGATAAATTTGAATCCGGTATCT ATGTGGAT AT AG CAACTGGGG AACCTCTCTTTTCATCAAAAGACA 
AATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAG 

40 TCCTACAATATG ACG CGTATGGAAGTG CGGAGC CG AGTAGG AGATTCTCACCTTGGGCATGTCTTTACGG ATGGTCC 

ACAGG AC AAGGG CGG CT TACGT TACTGTATCAATAGC CTCTCT AT CCGCTTTATTCCCAAAG ACCAAATGGAAGAAA 
AAGg CT ACG CTTATTTACT AG ATT ATGTTGATT AA 

45 MNDKLKI FLLLGVFFIAITGFYVLLIRNAGQTDASQIEKAAVSQGGKAVKKTEISKDADLHEI YLAGGCFWGVEEYF 

SRVPGVTDAVSGYANGRGETTKYELINQTGHAETVHVTYDAKQISLKEILLHYFRIINPTSKNKQGNDVGTQYRTGV 
YYTDDKDLEVINQVFDEVAKKYDQPIAVEKENLKNFVVAEDYHQDYLKKNPNGYQiI>A/NQAAYPVIDASKYPKPSD 
EELKKTLSPEEYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKDKFESGCGWPSFTQPISPDWTYKEDK 
SYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFI PKDQMEEKGYAYLLDYVDZ 

50 

ID210 - 4127.1 

ATGAAAAAGAAATGGATGTATTATGCTGCTTGTTCTTCTAATGAATCTGCCGATGACAGTTCATCTGATAAAGGAGA 
CGGCGGTTCGCTAGTCGTTTATTCACCAAACTCAGAGGGCTTAATTGGAGCAACTATTCCTGCCTTTGAAGAAAAAT 
ATGGTATC AAAGTAGAACTGATTCAAG CTGGT ACTGG AGAACTTTTC AAAAAACT AGAGT CAGAAAAAGAAGTTCCT 

55 GTAGCTGATGTTATCTTTGGTGGTTCTTATACACAATATACTACCCACGGAGAACTCTTTGAAAACTATACTTCAAA 
AG AAAATGAT AATGTT AT CAAAG AATAT C AAAACACAACTGGCT ACTCT ACT C CTT AT ACACT AGATGGT AGTGTTT 
TAATCGTCAACC CTGATTT AACT AAAGG C ATG AAC AT CG AAGGAT AT AACGAT CTTTT CAAAC CTG AACT AAAAGGA 
AAAATCGCAACrGCTGACCCAGCAAACTCTTCTAGCGCCTTTGCTC AATT AAC AAAT ATGCTACAAGCTCAAGGTGG 
TTACAAAGATGATAAGGCTTGGTCTTATGTAAAAGATCTTTTCACACTTATTGATGGTAAAATCGGTTCAAGTTCAT 

60 CTAGTGTCTATAAAGTAGTCGCTGATGGAGAAATGGCTGTTGGTCTCTCTTATGAAGATCCAGCAGTTAAACTCTTA 
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AATGACGGAGCTAACATTAAGGTAGTCTATCCAAAAGAAGGAACCGTCTTCCTACCTGCTAGTGCTGCTATCGTTAA 
AAAAT CTAAAAAT ATGGAAAATGC CAAGAAATTT ATCGATTT T ATT ATCTCTCAAG AAGTACAAG AT ACACTTGGT A 
CAACCACTACTAACCGTCCTGTTCGTAAAAATGCTAAAACAAGCGAAAACATGAAACCAATTGACAAAATCAAAACA 
CT CACTGAAG ATT ATG ATT ATGTCATC AAG AAT AAATC AG AT AT CGTT AAGAAAT ACAACG AAG TCTTTACAG ATAT 
CCAATCTAAACAGTAA 



MKKKWMYYAACSSNESADDSSSDKGDGGSLWYSPNSEGLIGATIPAFEEKYGIKVELIQAGTGELFKKLESEKEVP 
VAD V I FGGS YTQ YTTHG E L FEN YTS KE NDNV I KE YQNTTG YST PYTLDGSVLI VN P D LT KGMN I EG YND L F K P E L KG 
KIATADPANSSSAFAQLTNMLQAQGGYKDDKAWSYVKDLFTLIDGKIGSSSSSVYKWADGEMAVGLSYEDPAVKLL 
NDGANI KWYPKEGTVFLPASAAIVKKSKNMENAKKFIDFI ISQEVQDTLGTTTTNRPVRKNAKTSENMKPIDKIKT 
LTEDYDYVI KNKSDIVKKYNEVFTDIQSKQZ 

ID211 - 4127.2 

ATGAGTGAGATCAAAATTATTAACGCCAAAAAAATCTACCACGATGTCCCTGTTATTGAGAATTTGAACATTACAAT 
TCCAAAAGGAAGTCTCTTTACCCTTCTTGGAGCTTCAGGATGTGGGAAAACGACCCTTCTTCGTATGATTGCAGGTT 
T CAAC AGT ATCGAAGGTGG AGAATTTT ACTTCGATG ATAC AAAAATC AAT AAT ATGGAACCC AG C AAACG CAATATC 
GGGATGGTTTTCCAAAACTACGCTATTTTCCCACATTTGACTGTCCGAGACAACGTTGCTTTTGGTCTTATGCAAAA 
GAAGGTTCCAAAAGAAGAATTGATT CAACAGAC CAACAAGT ATCTTG AACTCATGC AAAT TG CTCAATATGCGGATC 
GAAAGCCCGATAAACTCAGTGGTGGACAACAACAACGTGTCACCTTGGCATGCGCCTTAGCGGTTAATCCAAGTGTT 
CTCCTCATGGACGAGCCACTTAGTAATCTGGAGGCCAAACTTCGCTTGGATATGCGTCAAGCCATCCGAGAAATCCA 
ACACGAAGTGGGAATTACAACTGTTTATGTAACCCACGACCAAGAAGAAGCCATGGCTATTTCAGACCAAATTGCTG 
TTATGAAAGATGGGGTGATCCAAGAAATCGGCCGACCAAAAGAACTCTATCATAAACCAGCTAATGAGTTTGTGGCA 
ACCTTTATCGGACGCACAAATATTATCCCTGCCAATCTTGAAAAACGGAGCGACGGCGCTTATATCGTCTTTTCAGA 
TGGCTATGCCCTTCGAATGCCAGCTCTTGATCAGGTTGAGGAGCAAGCTATTCATGTAAGCATTCGTCCCGAAGAGT 
TTATCAAAGATGAATCTGGAGATATTGAAGGAACTATTAGAGATAGCGTCTATCTTGGACTAAATACGGATTATTTC 
ATTG AGAC AGGTTTTGCCTCAAAAATTCAAGTT AGTG AAGAAT CAACTTTTGAAG AAGATCTACAAAAAGG CAATCG 
TATTCGTCTACGAATCAATACGCAAAAATTAAACATCTTTTCTGCAGATGGTTCCCAAAACCTGATAAAAGGAGTCA 
ACCATGGAACGTAA 

MS E I K 1 1 N AKK I YHD VP V I E NLN I T I P KGS L FT L LG AS G CG KTTL LRM I AG FNS I EGG E F Y FDDTKI NNME P S KRN I 

GMVFQNYAIFPHLTVRDOTAFGLMQKKVPKEELIQQTNKYLELM^ 

LI^EPLSNLEAKUUjD^QAIREIQHEVGITWWTHDQEE 

TFIGRTNIIPANLEKRSDGAYIVFSDGYALRMPALDQVEEQAIHVSIRPEEFIKDESGDIEGTIRDSVYLGLNTDYF 
IETGFASKIQVSEESTFEEDLQKGNRIRLRINTQKLNIFSADGSQNLIKGVNHGTZ 

ID212 - 4136.1 

ATGAAG AAAAAATT ATTGGC AGGTGCCAT C ACACTATTATCAGTAGC AACTTTAG CAG CTTGTTCGAAAGGGTCAG A 
AGGTG CAGACCTT ATCAG CATGAAAGGGG ATGTCATT AC AGAAC ATCAATTTT ATG AGCAAG TG AAAAGCAACCCTT 
CAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGAAAAACAATATGGCTCAGAGCTTGATGATAAA 
GAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATATGGCGAAAACTACCAACGTGTCTTGTCACAAGCAGGTAT 
GACTCTTGAAACACGTAAAG CT CAAATTCGTACAAGT AAATT AGTTG AGTTGGCAGTTAAGAAGGTAGCAGAAGCTG 
AATTG ACAGATGAAG CCT ATAAGAAAGCCTTTGATGAGT ACACT CCAG ATGTAACGGCTCAAATCATCCGTCTTAAT 
AATGAAG ATAAGG C CAAAGAAGTTCTCGAAAAAGCCAAGG C AG AAGGTG CTGATTTTGCTCAATTAGCCAAAGATAA 
TTCAACTG ATGAAAAAACAAAAGAAAATGGTGG AGAAAT TAC CTTTGATTCTGCTT CAACAGAAGTACCTGAGC AAG 
TCAAAAAAGCCG CTTTCG CTTT AGATGTGGATGGTGTTTCTG ATGTGATTACAGCAACTGGC ACACAAGCCTACAGT 
AGCC AAT ATT ACATTGT AAAACT CACT AAG AAAACAG AAAAAT CATCT AATATTG ATG ACTACAAAG AAAAATT AAA 
AACTGTTATCTTGACTCAAAAACAAAATGATTCAAC ATTTGTT CAAAGCATT ATCGGAAAAG AATTGC AAG CAGCCA 
ATATC AAGGTT AAGG ACCAAGCCTTC CAAAAT ATCTT T AC CCAATATATCGGTGGTGGAGATTCAAG C T CAAG CAGT 
AGT ACAT CAAACG AAT AG 

MKKKLLAGAITLLSVATIJ^CSKGSEGADLISMKGDVITEHQFYEQVKSNPSAQQVLL^MTIQKVFEKQYGSELDDK 
EVDDTIAEEKKQYGE^YQRVLSQAGMTLETRKAQIRTSKLVELAVKKVAEAELTDEAYKKAFDEYTPDVTAQIIRLN 
NEDKAKEVLEKAKAEGADFAQLAKDNSTDEKTKENGGEITFDSASTEVPEQVKKAAFALDVDGVSDVITATGTQAYS 
SQYYIVKLTKKTEKSSNIDDYKEKLKTVILTQKQNDSTFVQSIIGKELQAANIKVKDQAFQNIFTQYIGGGDSSSSS 
STSNEZ 
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ID213 - 4137.3 

ATGAAAAAAAATATTAAACAATATGTAACCTTAGGTACTGTAGTGGTATTATCAGCATTTGTTGCTAACTCAGTTGC 
AGCTCAGGAGACTGAAACTTCTGAAGTATCAACACCAAAGTTGGTGCAACCTGTTGCACCAACGACTCCGATTTCGG 
AAGT ACAACCT AC AT CGGAT AACTCTTCGGAAGTT ACTGTAC AACCT CGAAC AGTTGAAACTACTGTTAAGGATCCA 
5 TCTTCTACAGCGGAAGAAACTCCTGTCTTAGAAAAAAATAATGTTACTTTAACAGGGGGCGGAGAAAATGTTACTAA 
AGAGTTAAAGGATAAATTTACTAGCGGTGACTTTACTGTAGTGATTAAGTACAATCAGTCAAGTGAGAAAGGCTTAC 
AAG CTCTGTTTGG AAT AT CTAATTCCAAAC CCGGTCAACAAAATAGTTATGT AGATGTGTTCCTTAG AG ACAATGGT 
GAGTTGGGGATGGAAGCGCGTGATACTTCTTCCAATAAAAATAACCTAGTATCCAGACCTGCTTCAGTTTGGGGTAA 
GT AC AAAC AAGAGG C TGTG ACT AACACTGTTG C AGTAGT AGC AGATTCAGT CAAAAAAACATATTCTTT AT ACGCAA 

10 ATGGTACAAAAGTAGTAGAAAAGAAAGTGGATAATTTCCTAAACATCAAGGATATTAAAGGTATTGATTACTATATG 
CTTGGGGGAGTGAAACGTGCAGGAAAAACGGCGTTTGGTTTTAACGGAACACTAGAAAATATCAAATTCTTTAATAG 
TG CATTGGATGAAGAAACTGTTAAAAAGATGAC AACAAACG CTGTTACTGG ACATTT AATTTATACGGCTAATGATA 
CAACAGGTTCTAACTATTTCCGTATTCCAGTTCTGTATACTTTTAGCAATGGTCGGGTATTTTQWVGCATTGACGCT 
CGTTACGGTGGAACTCATGATTTCTTGAATAAAATTAATATTGCTACAAGTTATAGTGATGATAATGGTAAGACATG 

15 GACTAAACCAAAATTAACATTGGCATTCGATGATTTTGCGCCAGTACCATTAGAATGGCCTCGTGAAGTTGGTGGAC 
GTG ACTT AC AAATCAG CGGTGGTGCAAC CT ATATTG ACTCTGTT ATTGTTGAAAAAAAGAACAAAC AAGT ACTCATG 
TTTGCTGATGTGATGCCTGCTGGAGTAAGTTTTAGAGAAGCAACTAGAAAAGATTCAGGTTATAAACAAATTGATGG 
TAATTATTACCTTAAATTAAGGAAACAAGGTGATACTGATTACAATTATACTATTCGTGAGAATGGTACTGTATACG 
ACGATCGTACCAACAGACCAACTGAATTTTCAGTAGATAAAAATTTCGGTATTAAACAAAATGGTAATTATTTGACG 

20 GTAGAGCGG 

MKKNIKQYVTLGTVWLSAFVANSVAAQETETSEVSTPKLVQPVAPTTPISEVQPTSDNSSEVTVQPRTVETTVKDP 
SSTAEETPVLEKNNVTLTGGGENVTKELKDKFTSGDFTWIKYNQSSEKGLQALFGISNSKPGQQNSYVDVFLRDNG 
E LGMEARDTSSNKNNLVSRP ASVWG KY KQEAVTNTVAWADS VKKTY SLYANGTKWEKKVDNFLN I KD IKG I DY YM 
25 LGGVKRAGKTAFGFNGTLENIKFFNSALDEETVKKMTTNAVTGHLIYTANDTTGSNYFRIPVLYTFSNGRVFSSIDA 
RYGGTHDFLNKINIATSYSDDNGKTWTKPKLTIiAFDDFAPVPLEWPREVGGRDLQISGGATYIDSVIVEKKNKQVLM 
FADVMPAGVS FREATRKDSGYKQI DGNYYLKLRKQGDTDYNYTIRENGTVYDDRTNRPTEFSVDKNFG I KQNGNYLT 
VER 

30 ID214 - 4185 

ATG AAAAAATTT AG C CTATTACT AG CTAT C CTACCATTTTTGGTTGCCTGTG AGAATCAAGCTACACCC AAAGAGAC 
TAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGTGCCACCATTTGACTACGAAGACAAGGGCAATCTGA 
CAGGCTTTGAT ATCGAAGTTTT AAAGGCAGT AGATGAAAAACT CAGCGACT ACGAGATTCAATTCCAAAG AACCG CC 
TGGGAGAGCATCTTCCCAGGACTTGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCG 

35 TGCTGAAAAATACCTTTACTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCA 

CTTCT CTTGACCAG ATCGCTGGT AAAAC AACACAAG AGG AT ACCGGAACTTCT AACG CTCAATT CATCAAT AACTGG 
AAT CAGAAACACACTG AT AAT CCCG CT ACAATT AATTTTT CTGGTGAGG AT AT TGGTAAACGAATCCTAG ACCTTGC 
TAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCGTGGTTTAGACCTCT 
C AG T CGTTGATTTAC CTTCTGC AG AT AGC CC C AG CAATT AT ATCATTTT CTCAAG CG ACC AAAAAGAGTTT AAAGAG 

40 CAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGAAAAACTCAGCAATACCTATCTAGGTGGTTC 
T T ACCT CC CAG AT C AATCTC AGTTAC AAT AA 



MKKFSLLLAILPFLVACENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTA 
45 WESIFPGLJDSGHYQAAANNLSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAQFINNW 
NQKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKIIKDRGLDLSWDLPSADSPSNYIIFSSDQKEFKE 
Q FD KAL KE L YQDGTLE KLSNT YLGGS YLPDQSQLQZ 

ID21? - 42U>J, 

50 ATGAAAAAAAATAGTTTATATATCATATCCTCACTCTTTTTTGCTTGTGTCTTATTTGTCTATGCTACGGCGACGAA 
TTTTCAAAACAGT AC C AGTG CT AGG C AGGTAAAAACGG AAACCTATACT AAT ACAGTAACAAATGTCCCTATTGACA 
TACGCTATAATAGTGATAAGTATTTTATTAGCGGTTTTGCTTCAGAAGTATCAGTGGTCTTGACTGGTGCAAATCGC 
CTATCGCTAGCT AGTG AAATG C AAG AAAGTACACGTAAATT CAAGGTTACTGCTGACCTAACAGATGC CGGTGTTGG 
AACGATTGAAGTT CCTTTGAG CATTGAAG ATTT ACCCAATGGG CTGACCGCTGTGGCG ACTCCGCAAAAAATTACAG 

55 T C AAGATTGGT AAG AAGGCTC AGAAGG ATAAGGT AAAG ATTGT ACCAGAGATTG ACC CT AGTCAAATTGAT AGTCGG 

GTACAAATTGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTACGAGTGACCAAGAGACATTGGATAGAATTGA 
T AAG ATTATCG CTGTTTTGCC AACT AGCG AACGT AT AAC AGGT AATTACAGTGGTTCAGT ACCTTTGC AGG CAAT CG 
ACCGCAATGGTGTTGTCTTACCGGCAGTTATCACTCCGTTTGATACAATAATGAAGGTGACTACAAAACCAGTAGCA 
CCAAGTTCAAG C AC ATC AAAT TCAAGT AC AAGC AGTT C ATCGG AG AC AT CTTCGTC AACGAAAGCAACT AGTT C AAA 

60 AACGAATTAA 
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MKKNSLYIISSLFFACVLFVYATATNFQNSTSARQVKTETYTNTVTNVPIDIRYNSDKYFISGFASEVSVVLTGANR 
LSLASEMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAQKDKVKIVPEIDPSQIDSR 
VQIENVMVSDKEVSITSDQETLDRIDKIIAVLPTSERITGNYSGSVPLQAIDRNGWLPAVITPFDTIMKVTTKPVA 
5 PSSSTSNSSTSSSSETSSSTKATSSKTNZ 

ID216 - 4127.3 

ATGTTGATTGGCGAAGGGTATCGGACTTTCCCTGTCCTGATTTATACCCAATTTATTAGCGAGGTTGGAGGAAATTC 
TGCTTTTG(^TTATGGCGATTATCATTGCCTTGGCAATTTTCCTTATCCAAAAACACATTGCAAACCGCTACAGTT 

1 0 TCAGCATGAATCTGCTCCATCCAATTGAGCCTAAAAAAACTACAAAAGGAAAAATGGCTGCCATTTATGCAACAGTC 
T ACGGAATTAT CTTTATCT CTGTTTTACCTCAAATCT ACTTAATTTATAC CTCTTTCCTAAAAAC ATCAGGTATGGT 
AT CTGTTAAAGGTTATTCTC CAAACAGTTACT^AGGT AG CTTTCCATCGT ATGGGATCTGCTATTTTCAATACCATTC 
GTATCCCTTTGATTGCCTTAGTTCTAGTTGTTCTATTTGCGACATTTATCTCCTACCTAGCCGTTAGAAAACGGAAT 
TTGTTTACAAACTTAATTGACAGCCTCAGTATGGTACCTTATATTGTACCAGGAACCGTTCTAGGGATTGCCTTCAT 

15 TTCTTCCTTCAATACTGGTCTATTTGGAAGTGGATTTCTTATGATTACAGGGACTGCTTTCATCTTGATTATGTCTC 
TATCTG CCAGAAGATTACCT T AT ACTATT CGCTCATCTGTTGCT AGCTTACAACAAAT AGCACCAAGT ATTGAAGAA 
G CTGCTGAAAG CTTAGGAAGT AGT CGT CT CAAT ACCTT TGCT AAGATTACAACT CCAATGATGCT ATCTGGTATCAT 
TTCTGGAGCCATCTTATCTTGA 

20 

MLIGEGYRTFPVLIYTQFISEVGGNSAFAI^IIIAIAIFLIQKHIAbmYSFSMNLLHPIEPKKTTKGKMAAIYATV 
YGIIFISVLPQIYLIYTSFLKTSGMVSVKGYSPNSYKVAFHRMGSAIFNTIRIPLIALVLWLFATFISYLAVRKRN 
LFTNLIDSLSMVPYIVPGTVLGIAFISSFNTGLFGSGFLMITGTAFXLIMSLSARRLPYTIRSSVASLQQIAPSIEE 
AAES LGS SRLNTFAKI TTPMMLSG 1 1 SGAI LS Z 

25 
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Table 4 
ID301 

ATGAATAAGAAAAAAATG ATTTTAACAAGTCTAG CC AG CGT CGCT ATCTT AGGGGCTGGTTTTGTTACGTCTCAGCC 
TACTTTTGT AAGAGCAG AAGAATCTCCAC AAGT TGT CG AAAAATCTTCATT AG AG AAG AAATATGAGGAAGCAAAAG 
CAAAAGCTG ATACTGCCAAGAAAG ATTACGAAACGGC T AAAAAGAAAG CAGAAG ACGCTCAG AAAAAGT ATGAAG AT 
GATCAGAAG AGAACTGAGGAGAAAGCTCGAAAAGAAGCAG AAG CAT CT CAAAAATTGAATGATGTGGCGCTTGTTGT 
TCAAAATGCATATAAAGAGTACCGAGAAGTTCAAAATCAACGTAGTAAATATAAATCTGACGCTGAATATCAGAAAA 
AATTAACAGAGGTCGACTCTAAAATAGAGAAGGCTAGGAAAGAGCAACAGGACTTGCAAAATAAATTTAATGAAGTA 
AGAGCAGTTGTAGTTCCTGAACCAAATGCGTTGGCTGAGACTAAGAAAAAAGCAGAAGAAGCTAAAGCAGAAGAAAA 
AGTAGCTAAGAGAAAATATGATTATGCAACTCTAAAGGTAGCACTAGCGAAGAAAGAAGTAGAGGCTAAGGAACTTG 
AAATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACT^ 

AAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCTAAATTAAAAAAAGGAGAAGCTGA 

GCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACrrTCTTGAC^ 

GTAAG ACT CAGGATGAATTAGAT AAAGAAGCAG AAGAAG CTGAGTTGGATAAAAAAGCTGATGAACTTCAAAATAAA 

GTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCT 

TCTTCAAAATAAATTAGCTGCTAAAAAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTCT 

TTCATCCTGAAGGTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTTGGATAAAAAAGCTGATGAA 

CTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGA 

TGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGCTGAATTGGAAAAAACTCAAAAAGAATTAGATC 

CTCTTAATGAGTTAGGCCCTGATGGAGATGAAGAAGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCAGCTCCT 

G CAC C AAAAC C AG AGC AAC C AG CT C C AG CT C C AAAAC C AGAGCAAC C AGCTC CTGCACC AAAAC C AG AG C AA C CAG C 

TCCAGCTCCAAAACCAGAGCAACC^GCTCCAGCTCCAAAACCAGAGCAACCAGCTAAGCCGGAGAAACCAGCTGAAG 

AGCCTACTCAACCAGAAAAACCAGCCACTCCAAAAACAGGCTGGAAACAAGAAAACGGTATGTGGTATTTCTACAAT 

ACTGATGGTTCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCGCTATGGC 

AACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCA 

AAGTAT CAGATAAATGGTACTATGTCAACAGCAATGG CGCT ATGGCGAC AGG CTGGCTC CAAT ACAATGGCTC ATGG 

TACTACCTCAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATACAACGGTTCATGGTATTACCTCAACGCTAA 

TGGTG ATATGGCGACAGG ATGGGCTAAAGTCAACGGTTCATGGT ACTACCT AAACGCTAACGGTG CTATGG CTACAG 

GTTGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAAGATGGA 

GATACCTGGTACTATCHTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTA 

TGTCAATGGCTTAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATGGGTTT 

AA 

MNKKKMILTSI^VAII^AGFVTSQPTFVRAEESPQWEKSSLEKKYEEAKAKADTAKKD 

DQKRTEEKARKEAEASQKLtTOVAljVVQNAYKE YREVQNQRSKYKSDAE YQKKLTEVDS KI EKARKEQQDLQNKFNEV 

RAVWPEPNALAETKKKAEEAKAEEKVAKRKYDYATLKVALAKKEVEAKELEIEKLQYErSTL^ 

KKLLAGADPDDGTEVIEAKLKKGEAELNAKQAELAKKQTELEKLLDSLDPEGK^ 

VADLEKEISNLEILIXjGADPEDDTAALQNKLAAKKAJEXiAKKQTELEKLLDSLDPEGKTQDELDKE 

LQNKVADLEKEISNLEXLLGGADSEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEEETPAPAPQPEQPAP 

APKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQPEKPATPKTGWKQENGMWYFYN 

TIXSS^IGWLQNNGSWYYLNANGAMATGWKDGDTWYYLEASGAMKASQWFKVSDKW 

YYI^ANGDMATGWLQYNGSWYYLNANGDMATGWAKVNGSWYYLNANGAMATGW 

DTVfYYLEASGAMKASQWFKVSDKWYYVNGLGAIiAVNTTVDGYKVNANGEWV^ 

ID3Q2 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTCGTAAATTTAGTGTTG 
TGCC^GTCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAACGAGGGAGCTACCCAAGTACCC^ 
GGGCAAATGAAAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGAGATAAGGCAAGGAAAGAG 
GTCGAGG AAT ATGT AAAAAAAATAGTGGGTGAG AGCT ATGCAAAATCAACTAAAAAG CG ACATACAATTACTGTAG C 
TCTAGTTAACGAGTTGAAC^ACATTAAGAACGAGTATTTGAATAAAATAGTTGAATCAACCTCAGAAAGCCAACTAC 
AGATACK3ATGATGGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCA 
AGTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGGAGAAAAGGT 
AGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCGTCGTAACTACC 
C AACCATTACTTACAAAACG CTTG AACTTGAAATTGCTGAGT CCGATGTGG AAGTTAAAAAAG CGG AGCT TG AACTA 
GTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGC 
TGAGGCT AC AAGGTT AAAAAAAATCAAGACAG ATCGTG AAG AAGCAG AAGAAGAAG CT AAACG AAG AGC AG ATG CTA 
AAG AGC AAGGT AAACCAAAGGGGCGGG C AAAACG AGG AGTTC CTGG AG AG CTAGC AAC ACCTGAT AAAAAAG AAAAT 
GATGCG AAGTCTTC AGATTCT AG CGT AGGTGAAG AAAC TCTT CCAAGCCCATCCCTGAAAC C AG AAAAAAAGGTAGC 
AGAAG CTG AG AAGAAGGTTG AAG AAG C T AAGAAAAAAG CCG AGGATCAAAAAGAAGAAGAT CGCCGT AACTAC CC AA 
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CCAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAAAAAAGCGGAGCTTGAACTAGTA 
AAAGAGG AAG CTAAGG AACCTCG AAACG AGG AAAAAGTTAAGC AAG C AAAAGCG G AAGTTGAGAGT AAAAAAGCTGA 
GGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAGCAGCAGAAGAAG 
ATAAAGTTAAAGAAAAACCAGCTGAACAACC7VCAACCAGCGCCGGCTCCAAAAGCAGAAAAACCAGCTCCAGCTCCA 
5 AAACCAGAGAATCCAGCTGAACAACCAAAAGCAGAAAAACCAGCTGATCAACAAGCTGAAGAAGACTATGCTCGTAG 
ATCAGAAGAAGAATATAATCGCTTGACTCAACAGCAACCGCCAAAAACTGAAAAACCAGCACAACCATCTACTCC^ 
AAACAGGCTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCCAA 
AACAATGGCT CATGGTACT AC CTCAACAGC AATGGCG CTATGGCG ACAGGATGG CT CCAAAACAATGGTTC ATGGT A 
CTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTACTACCTAAACGCTAATG 
10 GTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTTCAATGGCGACAGGA 
TGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGATATGGCGACAGGTTGGGTGAAAGATGGAGA 
TACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATG 
TCAATGGCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC 
TAA 

15 

MFASKSERKVHYSIRKFSVGVASVWASLVMGSVVHATENEGATQVPT 
VEEWKKIVGESYAKSTKKRHTITVALVNELNNIKNEYI^^ 

SSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKDQKEEDRRNYPTITYKTLELEIAESDVEVKKAELEL 
VKVKANEPRDEQKI KQAEAEVES KQAEATRLKKI KTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGELATPDKKEN 
20 DAKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKKA^ 

KEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPAPKAEKPAPAP 
KPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQENGMWYFYNTDGSMATGWLQ 
NNGSWYYLNSNGAMATGWLQNNGSVryYLNANGSMATGWLQNNGSWYYLNANGSMATGWLQYNGSW 
WLQYNGSWYYL^ANGDMATGWKDGDTWYYLEASGAMKASQWFKVSDKWYYTO 

25 z 

ID3Q3 

ATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGAAAAAGTTGTTGTTCCTGAGCAATCATCTATTCCTTC 
GTATCCTGT ATCTGTT ACATCTAACCAAGG AAC AGATGT AG CAGTAGAACCAG CT AAAGCAGTTG CTCCAACAACAG 
30 ACTGGAAACAAGAAAATGGT ATGTGGTATTT TT ATAAT ACTGATGGTTCCATGGCAACAGG TTGGGTACAAGTTAAT 

AGTTCATGGTACTACCTCAACAGCAACGGTTCTATGAAAGTCAATCAATGGTTCCAAGTTGGTGGTAAATGGTATTA 
TGTAAATACATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATGGGTGC 
GTTAA 

35 MVKRRIRRGTREPEKWVPEQSS I PSYPVSVTSNQGTDVAVEPAKAVAPTTDWKQENGMWYFYNTDGSMATGWVQVN 

SSVTXYLNSNGSMKVNQWFQVGGKWYYVNTSGELAVNTSIDGYRVNDNGEWVRZ 

IP304 

CTGAATACAAGTTTTGTTCATGCTGCTGATGGGATTCAATATGTCAGAGATGATACTAGAGATAAAGAAGAGGGAAT 
40 AGAGTATGATGACGCTGACAATGGGGATATTATTGTAAAAGTAGCGACTAAACCTAAGGTAGTAACCAAGAAAATTT 
CAAGT ACGCGAATTCGTT ATGAAAAAG ATG AAACAAAAG ACCGT AGTG AAAAT CCTGTTACAATTGATGGAG AGG AT 
GGCTATGTAACTACGACAAGGACCTACGATGTTAATCCAGAGACTGGTTATGTTACCGAACAGGTTACTGTTGATAG 
AAAAG AAGCCACGG AT ACAGTTATCAAAGTTCCAGCT AAAAG CAAGGT TG AAGAAGTTCTTGTTCCATTTGCT ACTA 
AATATGAAGCAGACAATGACCTTTCTGCAGGACAGGAGCAAGAGATTACTCTAGGAAAGAATGGGAAAACAGTTACA 
45 ACGATAACTTATAATGTAGATGGAAAGAGTGGACAAGTAACTGAGAGTACTTTAAGTCAAAAAAAAGACTCTCAAAC 
AAGAGTTGTTAAAAAAAGAACC^GCCCC^GTTCTTGTCCAAGAAATTCC^ATCGAAACAGAATATCTCGATGGCC 
CAACTCTTGATAAAAGTCAAGAAGTAGAAGAAGTAGGAGAAATTGGTAAATTACTCTTACTACAATCTATACTGTAG 

LNTS FVHAADG I Q YVRDDTRD KEEG I E YDDADNGD 1 1 VKV AT KP KWT KKISSTRIRYEKDETKDRSEN P VT I DG ED 
50 GYVTTTRTYDVNPETGYVTEQVTVDRKEATDTOIKVPA 

TITYNVDGKSGQVTESTLSQKKDSQTRWKKRTKPQVLVQEIPIETEYLDGPTLDKSQEVEEVGEIGKLLLLQSILZ 



ID305 

55 ATGAAGCTTTTGAAAAAAATGATGCAAATCGCACTAGCCACATTTTTCTTCGGTTTGTTAGCGACAAATACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAATGGTAGAACCTACTACAAAAAGGGGGATCTAAAAGAAA 
CCTACTG GAGAGTGATAGATGGG AAGT ACT ATT ATTTTG ATC CTTTATCCGG AG AGATGGTTGTCGG CTGGCAATAT 
AT AC CTG CT CC ACACAAGGGGGTTACGATTGGTC CT^ CT CC AAGAAT AGAGATTGCTCTTAGACCAGATTGGTTTTA 
TTTTGG TC AAG ATGGTGTATTAC AAG AATTTGTTGGCAAG CAAGTTTTAG AAGC AAAAACTG CT ACGAAT ACCAACA 

60 AACAT C ATGGGGAAG AAT ATG AT AGC C AAGC AG AG AAACG AGTCT ATTATTTTG AAGAT CAG CGTAGTTATCAT ACT 
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TTAAAAACTGGTTGGATTTATGAAGAGGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGATTCGCGCATCAA 
CAGATTGACGGTTGGAG AGCTAGCACGTGGTTGGGTTAAGG ATT ACCCT CTT ACGTATGATGAAG AGAAG CTAAAAG 
CAGCTCCATGGTACTATCTAAATC CAG CAACTGGC ATT ATGCAAACAGGTTGGCAAT ATCTAGGTAAT AG ATGGTAC 
TACCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAGATGCTGAAAA 
5 TGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTC 
GTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATATGAAAACAGGCTGGTTCCAAGTC 
AATGGTAACTGGTACTATGCCTATGATTCAGGTCCTTTAGCTGTTAATACCACAGTAGGTGGTTACTACTTAAACT 
TAATGGTGAATGGGTTAAGTAA 

10 MKLLKKMMQIAIATFFFGLLATNTVFADDSEGWQFVQENGRTYYKK^ 

IPAPHKGVTIGPSPRIEIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQRSYHT 
LKTGWIYEEGHWYYLQKDGGFDSRINRLTVGELARGWVKDYPLTYDEEKLKAAPWYYLNPATGIMQTGW 
YI^SSGAMATGWYKEGSTWYYLDAENGDMRTGWQNI^NKWYYLRSSGAMATGWYQESSTWYYLNASNGDMKTGW 
NGNWYYAYDSGALAVNTTVGGYYLNYNGEWVKZ 

15 

ID306 

TTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTCAAACCTTTCAACAGTTAAAACTAAAGC 
TACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTACGTCTGATCAGTCTGGTTGGGTGGAATCTAATGGTA 
AATGGTATTTCTATGAGTCTGGTGATGTGAAGACAGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGAC 
20 TTAGGTGTCATGCAGACTGGATTTGTAAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTAC 
AGGCTGGGGAACAGATGGTAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAA 
ATGGC ACTTGGT ATTAC CTTG ACGAAG CAGG T ATCATGAAG ACAGGTTGGTTTAAAGT CGGACC AC ACTGGTACTAT 
GCCTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGAATGGGT 
AAACTAG 

25 

LAGRYGSAVQCTEVTASNLSTVKTKATWEKPLKDFRASTSDQSGOTESNGKVW 

LGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFDGSGAMKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYY 
A YGS G AliAVS TTT PDG YRVNGNGEWVN Z 

30 ID307 

ATGAAAATTTTGAAAAAAACTATGCAAGTTGGACTGACAGTATTTTTCTTTGGTTTGCTAGGGACCAGT^ 
T GC AG ATG ATT CTGAAGG ATGG CAGT T TG T C CAAG AAAACGG AAG AAC CT AC T AC AAAAAGGGGG AC CT CAAAGAAA 
C CTACTGG CG AGTGATTGATGGT AAGTACTATT ATTTTG ATT CT CTATCTGG AGAGATGGTTGTCGG CTGGCAATAT 
ATC C CGTTTC CATCTAAAGGT AGTACAATTGGT CCTTACCCAAATGGT ATCAGATTAGAAGGTTTT CCAAAGTCAGA 

35 GTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGTTGGTTGGAAAACATTAGAGATTAAAACTAAAGACA 
GTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGATTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTAC 
TTTAATCAAAATCATTCTTTAGAGACAGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGCTAAGACGGAAAT 
TAATGGAGAAAACTACCTTGGTGGTGAAAGACGTGCGGGGTGGATAAACGATGATTCGACTTGGTACTACCTAGATC 
CAACAACTGGTATTATGCAAACAGGTTGGCAATATCTAGGTAATAAGTGGTACTACCTCCGTTCCTC^ 

40 GCCACTGG CTGGT ATC AGGAAGGTACC ACTTGGTATTATTTAGAC CACC CAAATGGCG ATATGAAAACAGGTTGGCA 

AAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCAAGATGGTTCAACTT 
GGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAATGGCAACTGGTACTATGCTTAT 
AGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGTCAACTATAATGGCGAATGGGTTCGGTAA 

45 MKILKKTMQVGLTVFFFGLLGTSTVFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDSLSGEMVVGWQY 
IPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKREDSEDKEEKRYYTNYY 
FNQbWSLETGWLYDQSNWYYLAKTEINGENYLGGERRAGWINDDSTWYYLDPTTGIMQTGWQY 
ATGWYQEGTTWYYLDHPNGDMKTGWQNLGNKMYYLRSSGAMATGWYQDGSTWYYLNAGNGDMKTGWFQVNGNWYYAY 
SSGALAVNTTVDGYSVNYNGEWVRZ 

50 

ID30B 

ATGAAAAAG AAATTAACTAGTTT AGCACTTGTAGG CG CTTTTTTAGGTTTGTCATGGTATGGGAATGTTCAGGCTCA 
AGAAAGTT GAGGAAAT AAAATC CACTTTATC AATGTTCAAGAAGGTGGCAGTGATGCGATTATTCTTGAAAGCAATG 

55 GACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGATGGAAGTGATTCTCGCTATCCATGGAGAGAA 
GGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCGTGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAAAACT 
TGATTTT ATTTTGGTGACC C AT ACCCAC AGTG AT CATATTGG AAATGTTGATGAATT AC TGTCTACCT ATCCAGTTG 
ACCGAGTCTATCTTAAGAAATATAGTGATAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTAT 
GAT AAGGTTTT ACAG ACTGCTG C AGAAAAAGGTGTTTCAGTT ATT CAAAAT ATCAC AC AAGGGGATG CTCATTTTCA 

60 GTTTGGGGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATTAAAGAAAATTTGGG 
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ATGACAATTCCAATTCCTTGATTAGCGTGGTGAAAGTCAATGGCAAGAAAATTTACCTTGGGGGCGATTTAGATAAT 
GTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTTTAATCATCACCATGATAC 
CAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGATTGTTCAAACTTCGGATAGTCTACCTT 
GGAAAAATGGTG TTG AT AGTG AGT ATGT TAAT TGGCTCAAAG AACGAGGAATTGAG AGAATCAACG CAGCC AGCAAA 
GACTATGATGCAACAGTTTTTGATATTCGAAAAGACGGTTTTGTCAATATTTCAACATCCTACAAGCCGATTCCAAG 
TTTTCAAGCTGG TTGGC AT AAGAGTG C AT ATGGGAACTGGTGGT AT CAAGCGCCTGATTCT AC AGGAGAGTATGCTG 
T CGGTTGGAATG7VAATCG AAGGTGAATGGT ATTACTTT AACCAAACGGGTATCTTGTT AC AG AAT CAATGG AAAAAA 
TGGAACAATCATTGGTTCTATTTGACAGACTCTGGTGCnTCTGCTAAAAATTGGAAGAAAATCGCTGGAATCTGGTA 
TTATTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTATTATTTGGATGTTGATG 
GTTCTATGAAGACAGG ATGGCTTCAATATATGGGGCAATGGT ATT ACTTTGCT CCATCAGGGG AAATGAAAATGGG C 
TGGGTAAAAGATAAAGAAACCTGGTACTATATGGATTCTACTGGTGTCATGAAGACAGGTGAGATAGAAGTTGCTGG 
TCAACATTATTATCTGGAAGATTCAGGAGCTATGAAGCAAGGCTGGCATAAAAAGGCAAATGATTGGTATTTCTACA 
AGACAGACGGTTCACGAGCTGTGGGTTGGATCAAGGACAAGGATAAATGGTACTTCTTGAAAGAAAATGGTCAATTA 
CTTGTGAACGGTAAGACACCAGAAGGTTATACTGTGGATTCAAGTGGTGCCTGGTTAGTGGATGTTTCGATCGAGAA 
ATCTG CT AC AATTAAAACT ACAAGTC ATT CAG AAATAAAAG AAT CC AAAGAAGT AGTG AAAAAGGATCTTGAAAATA 
AAGAAACG AGT CAACATGAAAGTGTT AC AAATTTTTCAACT AGTC AAGATTTG AC ATCCT CAACTT CACAAAGCTCT 
GAAACGAGTGTAAACAAATCGGAATCAGAACAGTAG 

MKKKLTSLALVGAFLGLSWYGNVQAQESSGNKIHFINVQEGGSDAIILESNGHFAMVDTGEDYDFPDGSDSRYPWRE 

GIETSYKHVLTDRVFRRLKELGVQKLDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSERLWDNLYGY 

DKVLQTAAEKGVSVIQNITQGDAHFQFGDMDIQLYOTENETDSSGELKKIWDDNSNSLISWKVNGKKIYLGGDLDN 

VHGAEDKYGPLIGKVDLMKFNHHHDTNKSNTKDFI KNLS PSLI VQTSDSLPWKNGVDSEYVNWLKERGI ERINAASK 

DYDATVFDIRKDGFVNISTSYKPIPSFQAGWHKSAYGNWWYQAPDSTGEYAVGWNEIEGEWYYFNQTGILLQNQWKK 

WNNHWFYLTDSGASAKNWKKIAGIWYYFNKENQMEIGWIQDKEQWYYLDVDGS 

WVKDKETWYYMDSTGVMKTGEIEVAGQHYYLEDSGAMKQGWHKKA 

LVNGKTPEGYTVDSSGAWL.VDVSIEKSATIKTTSHSEIKESKEWKKDLENKETSQHESVTNFSTSQDLTSSTSQSS 
ETSVNKSESEQZ 

ID309 

ATGG AAATTAATGTGAGT AAATTAAG AACAG ATTTGCCTCAAGTCGGCGTGCAACCAT AT AGG CAAGTACACG CACA 
CTCAACTGGGAATCCGCATTCAACCGTACAGAATGAAGCGGATTATCACTGGCGGAAAGACCCAGAATTAGGTTTTT 
TCTCGCACATTGTTGGGAACGGTTGCATCATGCAGGTAGGACCTGTTGATAATGGTGCCTGGGACGTTGGGGGCGGT 
TGGAATGCTGAGACCTATGCAGCGGTTGAACTGATTGAAAGCCATT(^CCAAAGAAGAGTTCATGACGGACTACCG 
CCTTTATATCGAACTCTTACGCAATCTAGCAGATGAAGCAGGTTTGCCGAAAACGCTTGATACAGGGAGTTTAGCTG 
GAATTAAAACGCACGAGTATTGCACGAATAACCAACCAAACAACCACTCAGACCACGTTGACCCTTATCCATATCTT 
GCTAAATGGGGCATTAGCCGTGAGCAGTTTAAGCATGATATTGAGAACGGCTTGACGATTGAAACAGGCTGGCAGAA 
G AATGACACTGGCTACTGGTACGTACAT TCAGACGG CTCTT ATCCAAAAGACAAGTTTGAGAAAATCAATGG CACTT 
GGTACTACTTTG ACAGTTCAGGCTATATGCTTGCAGACCG CTGGAGG AAG CACACAGACGG CAACTGGTACTGGTTC 
GACAACTC AGGCGAAATGGCTACAGG CTGG AAGAAAATCGC TG AT AAGTGGTACTATTTCAACGAAGAAGGTGCCAT 
G AAG ACAGGCTGGGTC AAGT ACAAGGAC ACTTGGT ACT ACTT AGACG CTAAAGAAGGCGCC ATGGTAT CAAATGC CT 
TTATCCAGTCAGCGGACGGAACAGGCTGGTACTACCTCAAACCAGACGGAACACTGGCAGACAAGCCAGAATTCACA 
GTAGAGCCAGATGGCTTGATTACAGTAAAATAA 

ME I NVS KLRTD L PQ VGVQ P YRQVHAHS TGN PHS TVQN E AD YHW RKDPELGFFSHI VGNGC I MQ VG P VDNG AWD VGGG 
WNAETYAAVELIESHSTKEEFMTDYRLYIELLRNLADEAGLPKTLDTGS^^ 

AKWGISREQFKHDIENGLTIETGWQKNDTGYWYVKSDGSYPKDKFEKINGTWYYFDSSGYMLADRWRKHTD 

DNSGEMATGWKKIADKWYYF^EEGAMKTGWKYKDTWYYLDAKEGAMVSNAFIQSADGTGWYYLKPDGTI*ADKPEFT 

VEPDGLITVKZ 

ID310 

ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGATTGTTTATTTACTTGCGGTGTTGGTTGCAGGTAT 
CTATTTCTCTAAAAAAG AG ATG AAAGG AAAAG AGTT CTTT AAAGGAGATGGTTCGGTTCCTTGGT ATGTTACT TCX3G 
TATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCTCTTATGCAGGTAGCTGGATT 
TTATGGTTTG CT CAATT AGGG ATGGT AGTAG CT ATTCCACTGACAATTCGTTTTATCTTACCT AT CTTTG CACGGAT 
AGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAAGCACTTCGTATTATTTCAGCACTCTTGT 
TT ATT AT T TAT CAATTGGGACGTATGTCT ATC AT T ATGT ACCTCCC ATCAGCTGGTTTATCAGT ATTG ACAGG AATT 
GACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTATACTGGTGGTCTAAAATCCGTATT 
ATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTATTTGTACTGATTGCTAATATTA 
AAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCTTGCTGCAAATGAAAAACTTTTCGATCCT 
AACTTGCTTTCAAACTCCATCTTTTTAATTGTGATGGGTTCAGGCTTTACAATCTTGTCTTCCTATGCTTCATCTCA 
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AGATTTGGTTCAACGTTTTACTAC^ACACAAAATATTAAGAAACTTAATAAGATGTTGTTCACAAACGGTGTTTTGT 
CACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCTTGTACGTATTCTATCAAGTACAAAATGCAGATAGT 
GCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTACCAGTAGGTATCACAGGTTT 
GATCT TGGCAG CG ATTTATGC AGCATCTCAATC AACT ATTTCAACAGGTTTGAACTCTCTTGCAACTTCATGGACAT 
5 TGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTATCTCTAGCA 
GTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCTGCATACGAAT 
TTTCATGGGACTTGTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGATTTGTTTCTAAAAAAGCAAATAAACAAG 
GTGCTTATGCAGCGCTGATTGT ATCAACCAT CGT CATGGT ATTTATTAAATACTTCCTTC CTCCAAC AGCTGTTAGC 
TACTGGGCATATTCATTGATTTCAATCTCTGTATCAGTAGTTTCAGGTTATATTGTATCTGTTCTTACTGGAAATAA 
10 AGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGAAGTTCGTCACT 
AA 

MGTTGFTI IDLI ILIVYLLAVLVAGI YFSKKEMKGKEFFKGDGSVPWYVTSVSI FATMLSPISFLGLAGSSYAGSWI 
LWFAQLGMWAIPLTIRFILPIFARIDIDTAYDYLDKRFNSKALRIISALLFIIYQLGRMSIIMYLPSAGLSVLTGI 
15 DINILIILMGWAIVYSYTGGLKSVLWTDFIQGVILISGWLALFVLIANIKGGFGAVAETIiANGKFLAANEKLFDP 
NLLSNS I FLI VMGSGFTILSSYASSQDLVQRFTTTQNI KKLNKMLFTNGVLSLATATVFYLIGTGLYVFYQVQNADS 
AASNIPQDQIFMYFIAYQLPVGITGLILAAIYAASQSTISTGLWSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLA 
VGL F S I GVS I VMAHS D I KS A Y EW FNS FMGL VLG L LGG V F I LG FVS KKANKQG A Y AAL I VST I VMVF I KY FL P PTAVS 
YWAYS L I S I SVS WSGY I VSVLTGNKVS APKYTT I HD I TE I KADS S WEVRHZ 

20 

ID311 

ATGAAAATTAAT AAAAAAT ATCTAGCAGGTTCAGTGG C AGT C CTTGCCCTAAGTGTTTGTTCCTATGAG CTTGGTCG 
TCAC CAAGCTGGTCAGG ATAAGAAAGAGTCTAATCGAGTTG CTTATAT AGATGGTG AT CAGGCTGGTCAAAAGGCAG 
AAAACTTGACACCAGATGAAGTCAGTAAGAGGGAGGGGATCAACGCCG AACAAATCGT CAT CAAGATTACGGATCAA 

25 GGTTATGTGACCTCTCATGGAGACCATTATCATTACTATAATGGCAAGGTCCCTTATGATGCCATCATCAGTGAAGA 
GCTCCTCATGAAAGATCCGAATTATCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGTCATCA 
AGGTAGACGG AAAATACTATGTTTACCTT AAGGATGC AG C TCATG CGG AT AATATTCGGACAAAAGAAGAGATTAAA 
CGTCAGAAG CAGGAACG CAGTCATAAT CACGGGTCAGG AGCTAACGATCATGCAGTAGCTGCAGCCAGAGCC CAAGG 
ACGCTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATATCGTTC 

30 CTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGCAGAAGCCTATTGG 
AATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGCTCAACCAAGATTGTCAGAGAA 
CCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAACATTTCAAGCCTTTTACGTGAATT 
CTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCTTATTTTCGACCCAGCGCAAATCACAAGTCGAACCGCC 
AGAGGTGTAGCTGTCCCTCATGGTAACCATTACCACTTTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAAT 

35 TGCTCGT ATTATTC CCCTTCGTTATCGTTCAAACCATTGGGTACCAGATT C AAGACCAGAACAACCAAGTC CACAAT 

CGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAATCCTCAACC^GCTCCAAGCAATCCAATTGATGAGAAA 
TTGGTCAAAGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC 
CAAGG ATCTTTCAGCAGAAACAGCAG CAGGCATTGAT AG CAAACTGGCCAAGCAGGAAAGTTTATCTCAT AAGCT AG 
GAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTTACTAGCAAGAATTCAC 

40 CAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAACCTGTTGGAACGACTCAAGGATGT 
CCCAAGTGATAAAGTCAAGTTAGTGGATGATATTCTTGCCTTCTTAGCTCCGATTCGTCATCCAGAACGTTTAGGAA 
AACCAAATGCGCAAATTACCTACACTGATGATGAGATTCAAGTAGCCAAGTTGGCAGGCAAGTACACAACAGAAGAC 
GGTTATATCTTTGATCCTCGTGATATAACCAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCA 
CTGGATT AAAAAAG ATAGTTTGTCTG AAGCTG AG AG AG CGG C AG C CCAGGCTT ATGCT AAAG AG AAAGGTTTG ACC C 

45 CTCCTTCGACAGACCATCAGGATTCAGGAAATACTGAGGCAAAAGGAGCAGAAGCTATCTACAACCGCGTGAAAGCA 
GCTAAGAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTTAATCAT 
ACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACCTAAGGGGTATA 
CTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCCGCATTCAGATAATGGTTTT 
GGT AACGCTAGCG ACCATG TTCAAAG AAAC AAAAATGGT C AAG CTGAT ACCAATCAAACGGAAAAACCAAGCG AGG A 

50 GAAACCTCAGACAGAAAAAC CTGAGG AAG AAACCCCTCGAGAAGAGAAACCGC AAAG CGAGAAACCAG AGTCTCCAA 

AACCAACAGAGGAACCAGAAGAATCACCAGAGGAATCAGAAGAACCTCAGGTCGAGACTGAAAAGGTTGAAGAAAAA 
CTGAGAGAGGCTGAAGATTTACTTGGAAAAATCCAGGATCCAATTATCAAGTCCAATGCCAAAGAGACTCTCACAGG 
ATTAAAAAATAATTTACTATTTGGCACCCAGGACAACAATACTATTATGGCAGAAGCTGAAAAACTATTGGCTTTAT 
TAAAGGAGAGTAAGTAA 

55 

MKINKKYLAGSVAVLALSVCSYEIX3RHQAGQDKKESNRVAYIDGDQAGQKAE 

GYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGWIKVDGKYYVYLKDAAHADNIRTKEEIK 
RQKQERSHNHGSGANDHAVAAARAQGRYTTDDGYI FNASDI IEDTGDAYIVPHGDHYHYI PKNELSASELAAAEAYW 
NGKQGSRPSSSSSYNANPAQPRLSEtmNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTA 
60 RGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEK 
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LVKEAVRKVGrXSWFEENGVSRYIPAKDLSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIH 
QDLIJDNKGRQVDFEAIiDNLLERLKDVPSDKVKLVDDI 

GYI FDPRDI TSDEGDAYVTPHMTHSHWI KKDS LS EAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGAEAI YNRVKA 
AKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGF 
GNASDHVQRNKNGQADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEE PEES PEES EE PQVETEKVEEK 
LREAEDLLGKI QDPII KSNAKETLTGLKNNLLFGTQDNNT I MAEAEKLLALLKES KZ 

ID312 

ATGGAGGG ATTGGTT AG AGTGC ATTTATTG CCTGT ATTTGGCG ATT ACAAGC TATCT AAACTTACTACGCCTATT CT 
TCAACAG CAAGTAAACAAATGGGCTGACAAGGC AAATAAAGGCGAAAAAGGGGCATTTGCT AACTACTCTTTGCT CC 
ATAACATGAATAAGCGTATTTTGAAATATGGCGTAGCTATCCAGGTAATACAATACAACCCAGCTAATGATGTCATC 
GTTCCACGCAAACAGCAAAAAGAAAAGGCTGCTGTCAAATACTTAGACAACAAAGAATTAAAACAGTTTCTTGATTA 
TTTAGATGCTCTGGATCAATCAAATTATGAGAACTTATTTGATGTTGTTCTGTATAAGACTTTATTGGCCACTGGTT 
GCCGTATT AGTGAGGCTCTGGCTCTTGAATGGT CTGAT AT TGACCTAGAAAG CGGTGTTAT CAGCATCAAT AAGACA 
CTAAACCGCTATCAGGAAATAAACTCACCTAAATCAAGCGCTGGTTATCGTGATATACCAATAGACAAAGCCACATT 
ACTTTT ACTGAAACAATACAAAAACCGTCAAC AAATT CAG TCTTGG AAATTAGG CCG ATCTGAAACAGTTGTATTCT 
CTGTATTTACGGAGAAATATGCTTATGCTTGTAACTTACGCAAACGCCTAAATAAGCATTTTGATGCTGCTGGAGTA 
ACT AACGT ATCATTTCATGGTTTCCGCCATACACATACT ACTATGATGCT C TATGCT CAGGTTAGCCCGAAAG ATGT 
TC AGT ATAGATT AGG CCACTCT AATTTAATGAT C ACTG AAAAT ACTT ACTGG C ATACTAACCAAGAG AATGC AAAAA 
AAGCCGTCTCAAATTATGAAACAGCTATCAACAATTTATAA 

MEGLVRVHLLPVFGDYKLSKLTTP I LQQQVNKWADKANKGEKGAFANYSLLHNMNKRI LKYGVAI QVIQYNPANDVI 
VPRKQQKEKAAVKYLDNKELKQFLDYLDALDQSNYENLFDWLYKTLLATGCRISEALAIjEWSDIDLESGVISIN^ 
LNRYQEINSPKSSAGYRDIPIDKATLLLLKQYKNRQQIQSWKLGRSETWFSVFTEKYAYAO^LRKRLNKHFDAAGV 
TNVSFHGFRHTHTTMMLYAQVSPKDVQYRLGHSNLMITENTYWHTNQENAKKAVSNYETAINNLZ 
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CLAIMS: 

1 . A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 2. 

5 

2. A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 4. 

3. A protein or polypeptide as claimed in claim 1 or claim 2 provided in 
10 substantially pure form. 

4. A protein or polypeptide which is substantially identical to one defined in any 
one of claims 1 to 3. 

15 5. A homologue or derivative of a protein or polypeptide as defined in any one of 
claims 1 to 4. 

6. An antigenic and/or immunogenic fragment of a protein or polypeptide as 
defined 

20 in Tables 2-4. 

7. A nucleic acid molecule comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

25 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 
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(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

8. A nucleic acid molecule comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 



(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 4. 

9. The use of a protein or polypeptide having a sequence selected from those 
shown in Tables 2-4, or homologues, derivatives and/or fragments thereof, as an 
immunogen and/or antigen. 



10. An immunogenic and/or antigenic composition comprising one or more 
proteins or polypeptides selected from those whose sequences are shown in Tables 2- 
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4, or homologues or derivatives thereof, and/or fragments of any of these. 

11. An immunogenic and/or antigenic composition as claimed in claim 10 which 
is a vaccine or is for use in a diagnostic assay. 

5 

12. A vaccine as claimed in claim 1 1 which comprises one or more additional 
components selected from excipients, diluents, adjuvants or the like. 

13. A vaccine composition comprising one or more nucleic acid sequences as 
10 defined in Tables 1, 3 or 4. 

14. A method for the detection/diagnosis of S. pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one protein or 
polypeptide as defined in Tables 2-4, or homologue, derivative or fragment thereof. 

15 

15. An antibody capable of binding to a protein or polypeptide as defined in 
Tables 2-4, or for a homologue, derivative or fragment thereof. 

16. An antibody as defined in claim 15 which is a monoclonal antibody. 

20 

17. A method for the detection/diagnosis of S .pneumoniae which comprises the step 
of bringing into contact a sample to be tested and at least one antibody as define din 
claim 15 or claim 16. 

25 18. A method for the detection/diagnosis of S.pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as defined in claim 7 or claim 8. 
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19. A method of determining whether a protein or polypeptide as defined in 
Tables 2-4 represents a potential anti-microbial target which comprises inactivating 
said protein or polypeptide and determining whether 5. pneumoniae is still viable in 
vitro or in vivo. 

20. The use of an agent capable of antagonising, inhibiting or otherwise 
interfering with the function or expression of a protein or polypeptide as defined in 
Tables 2-4 in the manufacture of a medicament for use in the treatment or 
prophylaxis of S.pneumoniae infection 
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MSDRTIGILGLGIFGSSVLAALAKQDMNUAIDDHAERINQFEPVLARGVIGDITDEELLRSAGIDTCDTVVVATGENLESS 
VLAVMHCKSLGVPTVIAKVKSQTAKKVLEKIGADSVISPEYEMGQSLAQTILFHNSVDVFQLDKNVSIVEMKIPQSWAG 
QSLSKLDLRGKYNLNILGFREQENSPLDVEFGPDDLLKADTYILAVINNQYLDTLVALNSZ 

MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLVEAS 

GRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSK 

GQYTMMHSLIYRTDLLRASQFZ 



MKFNPNQRYTRWSIRRLSVGVASVVVASGFFVLVGQPSSVRADGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKP 
EGVTGNTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKEN 
TKDSIDVPAAYLEKAEGKGPFTAGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYE 
VDLNGNTVGKQGQALIDQLRANGTQTYKATVKVYGNKDGKADLTNLVATKNVDININGLVAKETVQKAVADNVKDS 
15 IDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQV 
ALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGUSKETVQKAVADNVKDSIDV 
PAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQLALD 
GNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKTVSMFQQP 
TZ 

20 

MKLKSYILVGYIISTLLTILVVFWAVQKMLIAKGEIYFLLGMTIVASLVGAGISLFLLLPVFTSLGKLKEHAKRVAAKDFP 
SNLEVQGPVEFQQLGQTFNEMSHDLQVSFDSLEESEREKGLMIAQLSHDIKTPITSIQATVEGrLDGIIKESEQAHYLATIG 
RQTERLNKLVEELNFLTLNTARNQVETTSKDSIFLDKLLIECMSEFQFLIEQERRDVHLQVIPESARIEGDYAKLSRILVN 
25 LVDNAFKYSAPGTKLEVVAKLEKDQLSISVTDEGQGIAPEDLENIFKRLYRVETSRNMKTGGHGLGLAIARELAHQLGG 
EITVSSQYGLGSTFTLVLNLSGSENKAZ 

MFGQTAQHGLTNSLKDFWIFLLNIGPQLAFFCQMLRCSRSVEQGTGNHRREFNMIQQIFSHFGMTHLGQIKLVYQESID 
30 LELLVNALNHHLLIDRLVLTPNQITIEIDRQIVHGLDLLKGRKDKEUDIICSMFRQLELASTQQICPINQRVHHGILAFGEIS 
DLVPAKNLPNRQDZ 

MEHLATYFSTYGGAFFAALGIVLAVGLSGMGSAYGVGKAGQSAAALLKEQPEKFASALILQLLPGTQGLYGFVIGILIW 
35 LQLTPELPLEKGVAYFFVALPUIVGYFSAKHQGNVAVAGMQILAKRPKEFMKGAILAAMVETYAILAFVVSFILTLRVZ 

MLKSEKQSRYQMLNEELSFLLEGETNVLANLSNASALIKSRFPNTVFAGFYLFDGKELVLGPFQGGVSCIR1ALGKGVC 
GEAAHFQETVP/GDVTTYLNYISCDSLAKSEIVVPMMKNGQLLGVLDLDSSEIEDYDAMDRDYLEQFVAILLEKTAWD 
40 FTMFEEKSZ 

MSVLEIKDLHVEIEGKEILKGVNLTLKTGEIAAIMGPNGTGKSTLSAAIMGNPNYEVTKGEVLFDGVNILELEVDERAR 
MGLFLAMQYPSEIPGITNAEFLRAAMNAGKEDDEKISVREFITKLDEKMELLNMKEEMAERYLNEGFSGGEKKRNEIL 
45 QLLMLEPTFALLDEIDSGLDIDALKVVSKGVNAMRGEGFGAMIITHYQRLLNYITPDVVHVMMEGRVVLSGGPELAAR 
LEREG YAKLAEELG YD YKEELZ 

MPYKRQRSFSMALSKLDSLYMAWADHSKNPHHQGKLEDAEQISLNNPTCGDVINLSVKFDAEDRLEDIAFLNSGCTIS 
50 TASASMMTDAVLGKTKQEILELATIFSEMVQGQKDERQDQLGDAAFLSGVAKFPQRIKCATLAWNALKKTIENQEKQZ 

MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTDHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATV 
LFAKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEKTEEL 

55 VQAPANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEM 
DRFDGKPLINRPVADGIRKTEEUNLALSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVSQMLPFVIGGGIMI 
ALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIP 
YAAGGEATSTLAGVSSGFLGALVGGFIAGALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAM 
NDFLGGLGGGSAVLLGIVLGGMMAVDMGGPVNKAAYVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLF 

60 KDKFTKEERNSGLTNIIMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVIALTSNALLYLVS 
VLVGAIVSGVVYGYLRKPQAZ 

MANKhrrSTTRRRPSKAELERKEAIQRMLISLGlAILLIFAAFKLGAAGn^YNLiRLLVGSU\YLAIFGLLIYLFFFKWI^ 
65 QEGLLSGFFTIFAGLLLIFEAYLVWKYGLDKSVLKGTMAQVVTDLTGFRTTSFAGGGLIGVALYIPTAFLFSNIGTYFIGS 
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